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Preface 


This book presents an introductory account of state space analysis of control 
systems and furnishes a background necessary for studying modern control 
theory. It is primarily designed to be a textbook for a second course in the 
field of feedback control systems for seniors and first year graduate students 
majoring in the field of engineering. In addition to class-room use it will 
prove to be useful as a reference book for practicing control engineers who 
wish to study basic theory of state space analysis of control systems and to 
get some working knowledge of modern control theory. The prerequisite on 
the part cf the reader is that he has had at least one course on feedback 
control systems at the junior-senior level. 

Most of the material presented in this book has been offered in the 
author's courses at the senior and first year graduate level in the Depart- 
ment of Mechanical Engineering, University of Minnesota. 

The first five chapters present essentially a background material and the 
last four chapters give main subjects of state space analysis of control sys- 
tems. Specifically, Chapter 1 presents the introductory material. Chapter 2 
reviews basic aspects of matrices and vectors. Chapter 3 through Chapter 5 
give in sufficient detail a basic mathematical background necessary for state 
space analysis of control systems. Because modern control theory is based 
on theory of ordinary differential equations, a detailed treatment of the 
vector matrix differential equation is given in Chapter 6. Chapter 7 discusses 
controllability and observability of control systems. Chapter 8 presents 
stability analysis of linear systems and nonlinear systems. Finally, Chapter 
9 presents analytical techniques for designing optimal control systems. At 
the end of each chapter numerous worked-out problems are provided. They 
supplement the development of theory and illustrate techniques presented. 
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Introduction 


1-1 INTRODUCTION TO STATE SPACE CONCEPTS 


The great progress in control science since about 1955 has changed basic 
concepts of analysis and synthesis of control systems. This progress has 
depended largely on mathematical study of optimal control systems. 

Modern control theory which is based on state space concepts is ex- 
tremely useful not only for designing a specific optimal control system 
but also for improving the principle on which the system will operate. 
By using the state space approach the control engineer may be able to 
design systems with performance characteristics that cannot be achieved 
by the classical approach by means of the frequency response method or 
the root locus method. 


Plant and control system A physical object to be controlled 
is called a plant. It may be a heating furnace, a chemical reactor, a space- 
craft, etc. A combination of components that acts together and performs 
a certain objective is called a system. A control system is a system which 
consists of such components as a sensor, controller, actuator, plant, etc. 
A plant is usually a given fixed component of a control system. 


Control of complex systems The specification of the control 
signal over the operating time interval is called the control law. Control 
system design based on modern control theory rests on determination of an 


optimal control law which minimizes a given performance index subject 
to certain constraints. A performance index is a quantitative index of per- 
formance. It is a measure of deviation from the ideal performance. Except 
for simple cases, optimal control of a plant requires a digital computer 
which utilizes feedback signals from the plant to produce the optimal control 
signal. Figure 1-1 shows a block diagram of a computer control system. 


2 Controller 


B Controller 


Computer 


Figure 1-1 Computer control system. 


The computer performs two functions: (1) producing an operational guide 
by use of the data from the plant; (2) computing, based on the operational 
guide, the command signals to the controllers which control various inputs 


to the plant. The command signals from the computer adjust the set points - 


of the controllers. Computations of the command signals which are optimal 
are based on the mathematical model of the plant and the given performance 
index. In some cases, output signals of the computer are used to adjust the 
parameters of the controllers. (This is an adaptive control system.) The set 
points of the controllers are adjusted by the command signals which may 
be produced by the same or a different computer. Figure 1-2 shows a block 
diagram of such a computer control system. In either system a computer is 
utilized in order to minimize the given performance index, or to maximize 
"profit" or "efficiency," etc. of the system. 

The block diagrams shown in Fig. 1-1 and Fig. 1-2 represent examples 
of control systems that will yield superior performance. Such control systems 
can be synthesized only by use of modern control theory based on state 
space approaches. The use of state space concepts facilitates analysis and 
synthesis of complex control systems in a unified manner. In synthesizing 
an optimal control system by using the state space technique, we need 
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Figure 1-2 Computer control system. 


solve only analytical aspects of the problem. A digital computer may be 
programmed to handle necessary numerical computations. 


Mathematical models The first step in the analysis and syn- 
thesis of a control system is the development of a suitable mathematical 
model of a given plant. A mathematical model of a plant can, in principle, 
be obtained from the structure of the plant and the properties of the indi- 
vidual elements of the plant, together with fundamental physical laws. 
Developing a suitable mathematical model of a plant is, except in simple 
cases, generally a difficult problem and may lead to serious error. Difficulty 
in developing mathematical models arises because modern plants are, in 
general, extremely complex. In addition, sufficient amounts of operating data 
may not be available. Note that an oversimplified mathematical model may 
eliminate intrinsic characteristics of the plant; an overcomplicated mathe- 
matical model will cause mathematical difficulty. A suitable model is a 
compromise between the mathematical difficulty attached to complicated 
equations and the accuracy desired in the final result. 

Equations describing the plant dynamics can often be approximated by 
ordinary differential equations of order n with time ¢ as the independent 
variable. In this book we treat only systems that can be represented by 
nth-order ordinary differential (or difference) equations or sets of n first-order 
ordinary differential (or difference) equations. 

An nth-order ordinary differential equation is called /inear if each term 
of the equation contains at most only first power of the dependent variable 
or its derivatives. Systems described by linear differential equations are 
called /inear systems. 


4 INTRODUCTION 


Linear systems can be divided into linear time-invariant systems and 
linear time-varying systems. A system is time-invariant if its characteristics 
do not change with time. Most physical systems are time-varying. If the 
changes in the system's characteristics are very slow in comparison with 
variations in the input, a linear time-varying system can be approximated 
with sufficient accuracy by a linear time-invariant system. 

A differential equation is called nonlinear, if it is not linear. Systems 
described by nonlinear differential equations are called nonlinear systems. 

Any physical system, if analyzed in great detail, is nonlinear. Some 
physical systems can, however, be approximated with sufficient accuracy 
by linear systems. If the deviation from linearity is so small that it is not 
important for the specific problem considered, then we can treat the system 
as linear. Thus, whether a system can be considered linear or nonlinear 
depends heavily on the accuracy desired in the solution. 

The difficulty of analyzing and synthesizing a system depends closely on 
the type of differential equation which describes it. Generally speaking, 
analysis and synthesis of a system become much easier if the system's 
differential equations are linear and time-invariant. Because such linear 
time-invariant differential equations represent, in an idealized sense, the 
behavior of many physical systems we shall treat such equations in great 
detail in this book. 


Reduction of system equations (/-/)' A complex plant is 
usually characterized by multiple inputs, a few to more than ten of major 
importance, and a similar number of plant outputs. A reasonably accurate 
and complete description of the plant will require several to over a hundred 
first-order differential equations. Reducing these to the essential equations 
by hand is laborious even in the simplest case and is impractical in most 
cases. Without reduction, we may not be able to study a complex system 
on an analog computer because of difficulties with algebraic loops, even if 
the machine is large enough. A digital computer can be programmed to 
handle a complex system but even here a reduction to only the essential 
equations is desirable to save storage space and thus computing time. The 
state space formulations provide an ideal means of making this reduction 
in a systematic manner with no loss of essential information. 


State and state space A state of a dynamic system is the 
smallest collection of numbers which must be specified at time ¢ = t, in 
order to be able to predict uniquely the behavior of the system for any 
time ¢ > t, for any input belonging to the given input set, provided that 


"INumbers in parentheses refer to the list of references at the end of this chapter. 
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every element of the input set is known for t > to. Such numbers are called 
state variables. The input set here is defined as the set of all possible inputs 
that can be applied to the system. The state of a system at time £ is uniquely 
determined by the state at time 7, and the known input for t > to, and is 
independent of values of the state and input before to. 

Suppose that at least n state variables x,, X», ..., Xn are needed to 
describe completely the behavior of a given system. Then the set of n state 
variables can be conside:ed as 2 components of a vector x. Such a vector 
x is called a state vector. A state space is defined as an n-dimensional space 
in which x,, Xs, ..., x, are coordinates. The state at time / of a system 
defined by n first-order differential equations can then be represented by 
a point in an n-dimensional state space. Note that in the state space approach 
of analysis and synthesis of control systems we deal with a set of n first-order 
differential equations rather than a single nth-order differential equation. 

A point in a state space at time ¢ is called a representative point. The 
locus of representative points for a time interval f, « 1 «t, is called a 
trajectory for this time interval. 


Review of classical theory of dynamics State space concepts. 
fundamental in modern control theory, evolved from the classical theory of 
dynamics of particles and rigid bodies. We now give a brief review of this 
classical theory of dynamics. (1-2) 

In investigating dynamics of particles or rigid bodies where constraints 
exist, it is advantageous to specify the positions by means of independent 
variables rather than to denote the positions of particles or rigid bodies by 
rectangular coordinates, because not all rectangular coordinates are linearly 


independent. 
Consider a system consisting of r masses, m, Ma, ..., m;. Let us define 
coordinates of r masses as (x, Xs, x4), (X4, Xs, Xe), - a Carca Xs, X37) and 


define forces acting on r masses as (X,, Xo, X3), (Xs, Xs, Xo), -- +> (Xar-2 
X,,.,, X3,). Then Newton's equations of motion become 


m;X, = X, (1:1, 2;....,37) (1-1) 


where m, = m, = my, m, = M; = me, ..., Myr-2 = M3;_, = ms. Equa- 
tions (1-1) represent motions of r masses in the absence of any constraints. 
If there exist the following s constraint equations: 


VAENE TAREE = 0 G = 1,250 558) 


then clearly we can define a set of 3r — s linearly independent variables 
qis do...» Ge (Where k = 3r — s) to express Xi, Xs, -<=> Xsr, OF 


Xi Sis codi) (11,2;....,3F) 


Such linearly independent variables q,, g2,---, gx are called generalized 


coordinates. 'The number of generalized coordinates is equal to the number 
of degrees of freedom of the system. In the present system the number 
of degrees of freedom is k = 3r — s. A knowledge of the magnitudes of 
the generalized coordinates enables us to determine the positions of the 
particles or rigid bodies at any given instant. 

Lagrange's equations of motion of a system are differential equations 
in terms Of g,, Go, - . - 59x, Which are obtained by use of Hamilton’s principle.! 
To simplify our discussion let us consider a conservative system without any 
forcing function. Lagrange showed that the equations of motion of a con- 
servative system can be given by 


d (aL ðL ER ET, 5 ! 
a (ax) - $e 9 (eqs. x) (1-2) 
where LT. 


T — kinetic energy 

V = potential energy 

q, — generalized coordinates 
Since L = L(q,, 92, ~ + ++ Gus Gis Goo - -., 4x), Eq. (1-2) reduces to a set of k 
second-order ordinary differential equations in q,, go, ..., gs. 

Hamilton showed that the equations of motion can be simplified if we 

define new coordinates p;, called the generalized momentum coordinates, 
such that 


pic (L-—12,...,K) 


in terms of p, and q;, the equations of motion can be reduced to a set of 2k 
first-order differential equations of the form: 


> ð 
i (i = 1:2: 055-56) (1-3) 
pore 
Pi 2g. 
where H-T-4-* 


[For derivation of Eq. (1-3), refer to Prob. A-1-4.] Equations (1-3) are 
called Hamilton’s equations of motion.* The function H is called the Hamil- 
tonian function. It is a known function of q; and p;(i= 1,2,...,K). 

For a system with k degrees of freedom a 2k-dimensional space in 
which qi, q» ..., qx and p,, ps, ..., p, are coordinates is called a phase 
space. A representative point in the phase space moves about the phase 
space, as both coordinates and momenta change with time. Hamilton's 


1See Prob. A-I1-1. 

*In solving problems of vibratory systems, Lagrange's equations are mor? convenient 
than Hamilton's. For theoretical investigation of dynamic systems, however, the latter 
are more convenient. 


equations (1-3) give the components of the velocity of the representative 
point in the phase space. The problem of classical dynamics of particles 
and rigid bodies is to investigate the path of the representative point in the 


hase space. ; ! : 
3 Notice that Eq. (1-3) is a set of 2k first-order differential equations 0o: 


the following form: 


ji = fC Gus d» +> Us Pis Po + +» Pk) (1-4) 
Di = gd do de Pis Pose + +> Ped) 
TE i ae | 


In the system of Eq. (1-4) the 2k variables q,, de ns dk; 
PE EREET describe completely the dynamics of the 
system. Since these 2k variables are the smallest collection 
of numbers which must be specified in order to describe N 
the system dynamics completely, they are state variables. 

Asan example consider the simple pendulum shown in | 
Fig. 1-3. In considering this system it is not convenient to 


i m 
use ordinary rectangular coordinates x and ys because g 
there is a constraint equation, x° + yolk. It is conveni- Figure 1-3 
oordinates 6 and r, in which r is Simple 
ent to choose polar c ve MN 


constant. Thus the only variable of the system or the 


generalized coordinate is 6. ' s neg - ^ 
In this system the mass m has risen from its equilibrium position a dis 


tance /(1 — cos 0). The IRL is therefore given by 
y —ngl(1 — cos 8) 
The velocity of the mass m is 16. Hence, the kinetic energy is given by 
T= ZY 
Let us put 6 = q. The Hamiltonian function H is given by 
Herp y= A ày + mgl(1 — cos q) 


Since p is defined by 


H can be written as 


C» - 1(1 — cos q) 
H = ym? + mgl( q 


From Eq. (1-3) 


p= uy 
p ml? (1-5) 
p= 22H —mgl sin q 
q 
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Equations (1-5) define the motion of the pendulum in the phase plane in 
which g and p are the coordinates. Here q and p completely describe the 
dynamics of this system. They are a set of state variables. 

Phase spaces have been used in the classical theory of dynamics to 
describe dynamics of systems consisting of particles or rigid bodies. For 
a system with k degrees of freedom the phase space is a 2k-dimensional 
space in which q;, d», ..., qx and pi, ps, . .. , p, are coordinates. Hence for 
a onc-degree-of-freedom system, the phase space becomes a phase plane, 
a two-dimensional phase space. For a two-degree-of-freedom system, the 
phase space is four-dimensional. 

The concept of degrees of freedom as used in the classical theory of 
dynamics may not apply to control systems. Hence the definition of the 
phase space previously given may not be directly applicable to control 
systems. Concepts of phase space can, however, be directly applied to 
control systems. In fact, concepts fff state space are evolution of concepts 
of phase space. Note that defirlitions of the state space used in modern 
control theory are more general than the definition of the phase space 
used in the classical theory of dynamics. (The "three-dimensional phase 
space" frequently used in the analysis and synthesis of optimal control 
systems actually means a three-dimensional state space.) 

This completes our brief review of phase space concepts used in the 
classical theory of dynamics of particles and rigid bodies. 

in the next section, we present a mathematical background necessary 
for subsequent discussions on state variables and state spaces. 


1-2 MATHEMATICAL BACKGROUND 


In this section we shall first discuss representation of system differential 
equations in terms of vector matrix notation. We shall next review existence 
and uniqueness theorems concerning solutions of differential equations. 
These theorems form a basis for precisely assigning a set of state variables 
to a given system with a given input set. 


Representation of system differential equations in terms of vector 
matrix notation Systems which we shall consider in this book are assumed 
to be described by ordinary differential equations of order n, or by a set 
of n first-order ordinary differential equations. 

We shall first show that a differential equation of order n can always be 
reduced to a set of n first-order differential equations. (We shall give detailed 
discussions on this subject in Chapter 4.) Consider the following differential 


equation of order n: 
(n) (n-1) 


xm (x Xs ey X ut) (1-6) 
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where u is the forcing function. By defining new variables X,, Xs, -<--> Xn 
such that 
xQ-— 
X;-3X 
(n-1) 
Xn = X 
Eq. (1-6) can be reduced to 
X, X 
X = Xs 
: (1-7) 
Xn-1 = Xn 


dose Ne Nasr no ho iE) 
i - i ial equations. 

Equations (1-7) are a set of n first order differentia 
* We shall next consider a more general case. Consider the control system 
shown in Fig. 1-4. Equations of the plant may be adequately described by 
a set of n first-order ordinary differential equations of the following form: 
die = fitu Mas cR EM s eira 1) (1-8) 

(i = 15:25 cove n) 


where X,, Xs, ..., Xn are state variables, and u, Us ---> 
variables. It is noted that the state variables x, are not necessarily d 
able. We may have to measure something else (usually, system outputs) an 


u, are control 


Output 


Element 


State Output 


ntrol 
sopira Variables 


Variables Variobles 


Figure 1-4 Block diagram of control system, showing terminology. 


wen estimate what the actual state variables are. We assume that the system 
outputs y,, ys, ..., Ym are directly measurable and are related to the state 


variables x,, X», .. . , x, and control variables u,, u.,..., u, by the following 
equations: 


= Bi Rin Nea cn cy E. uy, Uzee., U75 t) (1-9) 

(J = l2 m; s n) 
If the order of the system is other than small, such a set of equations 
becomes notationally complicated. It becomes necessary to use vectors and 


matrices to simplify the notation.! Rewritten with vector notation, Eq. (1-8) 
and Eq. (1-9) become respectively 


X — f(x,u, t) (1-10) 
and 
y = g(x, u, t) 
where x, u, y, f(x, u, 1),and g(x, u, t) are vectors defined by 
Xi u, Ji 
X5 Ur PE 
x = » Hn s » s y — 
Xn ur Ym 
Ax, u, t) gx, u, t) 
TAX, u, t) g(x, u, t) 
f(x; u,1):-— = 7 g(x, u, t) = 5 
f(x, u, t) ga (X, U, t) 


Here vectors x, u, and y are called respectively the state vector, the control 


vector, and the output vector. The system is specified by the vector-valued 
functions f and g. 


If the system shown in Fig. 1-4 is linear in x and u, a set of n first-order 
differential equations may be written as 


X, = ay(t)x, + as(t)xs + +++ + Ayalt) Xn bu +>- + by) 
x, = ay,(t)x, + ds (t)Xs ++ F ant )Xn + baltu, + ee + ba(t ju; 


Xn = An (t)xX, + as(1)xs + +- + Ann(t)Xn + ba(t)u, +--+ + bnt Ju; 


and m algebraic equations relating output variables, state variables, and 
control variables may be written as 


!In this book, lower case boldface Roman or Greek letters denote vectors. Capital 
boldface Roman or Greek letters denote matrices. 


y = at) + lhe, bcr Cin(t Xn + du(t)ui Fo t dy At )u; 
Ya = eu(t) + Cady E nes =F Con(t)Xn + dalt), H oc + d. (t)u- 


Ja = Cmi(t)X, + Cat Xp + 750 F Cmn(t Xn + dy (tu, ot + dart Ju 
Rewritten in terms of vector matrix notation 
x = A(t)x + B(r)u 
y = C(t)x + Du 


where 
a(t) aa), s ay (t) 
A(t) as) coc asx(t) 
A(t) = . . . 
iR) asy c dat 
au) bug) ce bu) 
ba(t) Belt) c7 ba(t) 
B(t) = . . . 
ba(t) b.) a b. (t) 
ext) Cit) =e Cya(t) 
Cat) Cub) oe Con(t) 
a rrr Bali 
d,{t) d,(t) vw d,,(t) 
dalt) d,(t) baad dst) 
D(t) = i s 
d) da) ++» dat) 


F 
The elements of A(t) depend on the plant parameters, and the gem E 
B(t) show how each control variable affects the state variables of the p * : 
C(t) shows how the output variables are related to the state varia - 
and D(t) shows how the control variables directly affect the output và 
ables. Here in any practical case n < n. 

We have presented a way to describe system equations in or E 
vectors and matrices. For theoretical work, notational simplicity epe 

i ion 1 enient and is actually essential 1o 

vector matrix operation is most conv 
space analysis of control systems. Such vector matrix representation of the 
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system equation in which x is a state vector is called a state space represen- 
Tation of the system equation. Large, complex problems can be handled 
with ease by the systematic format of the state space representation. 


210 Example 1-| Let us obtain a state Space representation of the 
following linear time-invariant system: 


d"x [mo d 
dr tpe tee + aE + a= (1-11) 
Let us put 
xi eme 
XQ =x 
(n-1) 
XQ —ox 


Equation (1-11) is equivalent to the following n simultaneous equations: 


Zi = Xp 
Xs X; 
Xn-1 = Xn 
E E moe ge 


Hence Eq. (1-11) can be written as the following state Space equation: 


X — Ax 
where 
Xi [ 0 1 0 0 
Xa 0 0 1 0 
EER fo. dim : 
Xn-1 0 0 [9] ey 1 
L Xn _J — “Wyo —Gnag oe —a, 


Existence and uniqueness of solutions of differential equations 
We shall next review existence and uniqueness theorems concerning solutions 
of ordinary. differential equations. Consider a free dynamic system defined 
by the following first-order differential equation: 


x= f(x, 1) 
si = x; 
Let us find the solution of this equation. Geometrically, the solution is 
an integral curve passing through point P;(x,, to). Since the slope of the 
m curve at point Py is (Xo, to), it is possible to draw a short line segment 
rom point P, with the slope f(x, f,). Let us call the end point of the short 
ine segment P,(x,, t). Then P,P, is an approximation to the integral curve 
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in a neighborhood of point P,. Next, draw a short line segment from point 
P, with the slope /(x;,/,). Call the end point of the short line segment 
P.(x,, t2). Repeating this process, we can construct a piecewise linear curve 
P,P,P,...consisting of short line segments, as shown in Fig. 1-5. If we make 
the length of each line segment smaller and smaller by increasing the number 
of line segments, then in the limit P,P,P,... will converge to a curve. 


Figure 1-5 Integral curve, showing solution of x = f(x, t). 


We may intuitively expect that the limiting curve will be the integral curve 
of the given differential equation. In order for this to be true, however, 
there are certain conditions which f(x, t) must satisfy. It is a well-known 
result in theory of ordinary differential equations that such conditions are 
given as the Cauchy-Lipschitz theorem. 


Theorem 1-1 Cauchy-Lipschitz theorem Ina dynamic system 
defined by 


— 


(a) f(x, t) is a single-valued, continuous function in the region T defined by 


Other 


X = ft) Xfi) 29:3; (1-12) 


|x — x| <A, 
and there exists a positive number M such that 


If t)| < M 


in the region T, and 
(b) the following Lipschitz condition 


lft) — fi |< KIx—z 


where points (x, £) and (z, t) are arbitrary points in the region T and 
K is a positive constant, is satisfied. 
Then there exists a unique solution defined in the interval 


; h 
0ct—tuncT, T = min (7, 37) 


and satisfying the initial condition x(t,) = x). The solution is continuous in 
the interval 0 < t — t, < T. The curve P,P,P,... converges to the integral 
curve. 

We shall not give a proof of this theorem here, but it is given in Prob. 
A-1-11. In this theorem, the notation min [r, (4/M)] means that we take the 
minimum of the values of 7 and A/M. If h is co, then the solution x = x(t) 
is unique in the interval 0 < t — t < T =r. 

Note that if f(x, 1?) does not possess a definite value at (x,, fo), then 
the existence of a unique solution passing through this point (xq, tẹ) is not 
guaranteed. In some cases no solution exists; in other cases there exists a 
solution or more than one solution. Such a point is called a singular point. 
Points that are not singular points are called ordinary points. In the absence 
of any forcing function, the trajectory passing through any ordinary point 
is unique. 

Notice that Lipschitz condition is satisfied if /(x, 1) possesses a con- 
tinuous first partial derivative 2 f/óx in the region I’. This can be seen easily 
because if ô f/Óx is continuous in the region I’, there exists a positive constant 
K such that 


Hence 
Ifo — fe 01» | Eas | < xix — zi 


Let us next consider a more general case where an nth-order free dynamic 
system is described by 


X = f(x, t), X(fj) = X, 
or 
Xv OD, 0) t) 
= LOAD) «ss Xt) D) (1-13) 
Xn = fiot)... x) t) 
where xa fo) em xu (Imm 2. 02) 


For this case, the Cauchy-Lipschitz theorem can be stated as follows: 


Theorem 1-2 Suppose that 
(a) the functions fies X2 -ee Xm t) in Eq. (1-13) are single-valued and 
continuous in the region defined by 
|x; — Xiol € hi (1 1, Bis s) 


0<t—h%<T 


and there exist positive numbers M; (i = 1, 2,---> n) such that 


Lis Xo XS t) |< Mi 


in the region T and Me 
(b) the following Lipschitz conditions: 


Ife Xo rss Xu 1) — foem OLS KZ [A — Fl 
G = 1 Fi ea ®) 


where points (Xi, Xs, -+ -> Xn t) and Gs Zines g Zgs t) -n 
points in the region T and K is a positive constant, are satisfied. 
Then there exists a: unique solution defined in the interval 


; h hn 
Gui edQ4 m T= min (s t+ gf.) 


Gi=1, Be as wg W): 


isfyi initial onditions x;(t.) = Xio 
goen na d th here. Instead we refer the 


We shall not give a proof of this theorem 

-1-12. 

reader to Prob. A-1-11 and Prob. A-1 

Suppose that the assumptions made in Theorem 1-2 um be 

obtain a set of unique solutions x,(t), x«t). - - - ,x«(t) of epi ied ie 
every set of the initial conditions Xo, Xeo» <- -> *no- Hence t e 
the system of Eq. (1-13) involves Xio, X20 +- +> *no- By putting 


Xio — Ci (m 1,2,: 7) 


the solution can be written às 
Xi = xy(ei, Co, ...3 Cn; t) 


Xp — X61 C» - -» Cn P) 


Xn = Xalo Co, «5 Cv t) 


: : f 
The solution involving n arbitrary constants is called a general solution o 


the nth-order system. "E 
In the next section we shall discuss ways of choosing a set of $ 


variables for a given system with a given input set. There bs vis d 
the result obtained in this section that in Eq. (1-13) if the p en sp dps 
in Theorem 1-2 are satisfied, then x,(t), x«t), - - ye Xa(t) for Ri p: FG 
can be determined uniquely for the given initial conditions, x;(t,). Xs(f9). 


«cians: Hi. 
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1-3. STATE VARIABLES 


In this section we shall discuss what variables in the system can constitute 
a set of state variables for the system once the input set, an aggregate of all 
the input functions under consideration, is given. The general procedure for 
associating a set of state variables or a state vector with the system is to 
associate with the system a vector X(t) such that x(t) is uniquely determined 
by x(/,) and u(r). In the following discussions we assume that the elements 
of x (1) are linearly independent of each other. 

Based on Theorem 1-2 presented in Section 1-2 we can make the follow- 
ing statement: In the linear or nonlinear system defined by Eq. (1-10) 
rewritten as 


X = f(x, u, r) 


if for any input vector u(/) which belongs to the input set considered for the 
System, the vector function f(x, u, ¢) is a single-valued vector function of 
its arguments and is sufficiently smooth in the time interval O si—1t,«T, 
then the system possesses a unique solution starting at any x(/,) and subjected 
to the given u(t). In such a system x(t) is clearly a state vector associated with 
the system, since x(t) can be uniquely determined for a given x(/,) and any 
element u(t) of the input set. Then n confonen of the vector x, namely 
Ye Xe +++, X, are a set of state variables. The unique solution of this 
system observed at time ¢ after starting at time /, in state x(t) = x, and 
affected by the input vector u(¢) is sometimes written as X(t) = (t; Xp, t). 
This notation clearly indicates that the present state vector X(t) is a func- 
tion of the input vector u(1) and the initial state Xs: 

Note that in the forced dynamic system, if the input vector u(t) exhibits 
à jump at ¢ = t, then the initial state (at 7 = ¢,) becomes ambiguous. To 
avoid this inconvenience we shall assume, unless otherwise stated, that the 
initial state is established at 1 = ‘)— as a point in the state space and the 
input (which may include discontinuities) is applied at ¢ = t,. Therefore, 
if the input is an impulse occurring at t = fj, then the initial state is 
established at ¢ = 14,— and a jump in the state occurs at ¢ = ti: 

Example 1-2. Consider the motion of a rigid body which has one 
point fixed at O. The motion is given by the following set of three first-order dif- 
ferential equations: 

Air, = (B — C)oyo, + ui 

Bo; = (C — A)oo, + Us 

Co; = (A — B)o,o, + Us 
where A, B, and C are the moments of inertia about the principal axes Ox, Oy, 
and Oz, respectively; œ, œ, and œ; are their component angular velocities; wu, us, 


and us, are the known torque inputs about the axes Ox, Oy, and Oz, respectively. 
In this system c, Gs, and o; are a set of state variables. 
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Linear systems Since linear systems are our main Dp 
in this book, we shall pay a particular attention to such systems. Consider 


the linear system defined by 


X = A(1)x + B(t)u (1-14) 
or 
Ky = y(t) xX, + au(t)xs 4- *-- + au(t)xn + pi(1) 
Xy = ay(t)xi + Ayo t)X_ +++ + as(t)xs + Pot) 
Xn = Analt) X + aut), + ++ + Gan(t)Xn + Palt) 
where 


Pit) = by (tu, + by(t)us + +++ + bit )u, 
Plt) = ba(t) + Bool t ue + +++ + ba(t ur (1-15) 


Prt) = ba(t Jur + bnt jus + +++ + bnr(t ur 


The matrices A(t) and B(t) are given, and u, which belongs to a given input 
set, is a known vector for the time interval 0< t—t< T. peers 
P(t), P:(t), -.., pu(t) are known functions of time for the —— 
Ox t — t, < T. Theorem 1-2 states that if the ait) and P) (IJ e i, " 
..., n) are continuous functions of time in the time interval 0 < t— : < 5 
then Eq. (1-14) possesses a unique solution satisfying the initial conditions. 
Hence if the a,,(¢) and balt) Gj 21,2,...,m k= 1, Ds cto) are xc 
tinuous functions of time and the u(t) are known continuous functions o 
time in the interval 0 < t — f£, < T, then x,(t), x(t), ... , Xn(t) are uniquely 
determined by x,(t,), x«(£4), . .. 5 Xn(fo) and u(t), u(t), sasg ns MARCAR 
if this uniqueness holds for any element of the input set, then Kio Kgs 2 ecg 
x, are a set of state variables for the system for the given input y 
Suppose next that the p;(t) are not necessarily continuous in the time 
interval 0 « ¢ — t, < T (where T can be infinity), but may have discontinui- 
ties at ¢ = t, (K —0,1,2,.. ). Then the variables x,, xs, . .., Xn can still be 
a set of state variables for the given input set, if at each — 
t=t,(k =0, 1, 2,...) the initial condition for a continuous time hee 
fk+ı X E fy, can be uniquely determined from the end condition E : 
previous time interval t, « t < t,,, and if this holds true for any e Pune 
u of the given input set. The reason for this is that since x, Xe sie Ka T 
a set of state variables for each continuous time interval x < t< tera 
the amount of jump, if any, of the trajectory at each discontinuity can a 
uniquely determined, then the solution for the time interval 0x a e: n SE 
satisfying the initial conditions Xi (to), 34) eee Xn(to) = an ei 
for a given u. Since Eq. (1-14) is a first-order differential equation, 


amount of jump in the trajectory is finite for the P(t) involving up to the 


impulse function, where the unit impulse functi : 
; g 3 on or th 
is defined by e delta function 


Òli = oo for t=0 
a(t) — 0 for tz 0 
ID O(t) dt = 1 


Example 1-3 Consider the plant shown in Fi 
- £. 1-6(A). It may be 
an electric motor. The differential equation of the plant is given by i oi 


* + (a + D + abt = Ku (1-16) 


K 
s (s+a) (s^ 5b) 


(A) 


— abx, — (GO D) X4 + Ku 


(8) 


Figure 1-6 Block diagrams for a plant. (A) Transfer function representation: (B) state 
Space representation, showing state variables and control function. j 


= third-order differential equation can be transformed into a set of three first- 
order differential equations by defining a set of variables. For example, let us define 


x, =x 
X, =x 
X% =X 


then Eq. (1-16) can be written as 
X, x» 
X, = X, 
X, = —abx, — (a + b)x; + Ku 
In this case, x,, x.,and X, are a set of state variables. if the rlant represents an 


ies motor, then x, denotes the angular position of the output shaft, x, the 
angular velocity of the output shaft, and x; the angular acceleration of the output 


shaft. The control function or the input to the plant, u(t), corresponds to the 
input voltage used to drive the electric motor. Figure 1-6(B) shows a state space 
representation of the plant shown in Fig. 1-6(A). 


Note that, in general, it is desirable that the state variables be quantities 
directly and accurately measurable. This, however, is not necessarily possible 
in a given situation because of the presence of noise. 

In the use of relay control, numerator dynamics in the transfer function 
of a plant would give rise to discontinuities in the output variable and/or its 
derivatives. This problem can be treated by introducing new coordinates to 
handle the discontinuous variables. We shall consider next a case where dis- 
continuities arise in the derivatives of the output. 

If any of the p,(t) of Eq. (1-15) involves a doublet function!, triplet 
function,? etc., then the amount of a jump of the trajectory in the state 
space at time of discontinuity may become infinity and the uniqueness of the 
solution may be lost. Hence, in this case x,, X», ..., x, cannot be a set of 
state variables. In such a case, it is necessary to choose an appropriate set 
of variables as state variables. We proceed to discuss this case. (In Chapter 
4 we present more thorough discussions of such a case.) Consider the plant 
shown in Fig. 1-7. This plant involves numerator dynamics. The differential 
equation for this plant is given by 


(n) (n-1) (m-1) 


(m) : 
x Bax +--+ +ax=bu+bdbu Le db (1-17) 
lL<m<n 


Figure 1-7 Block diagram of plant with numerator dynamics. 


Suppose that the input function u is a discontinuous function of time 1 
possibly involving the delta function, but does not involve any higher-order 
impulse functions, such as a doublet function, triplet function, etc. If we 
define 


Xi x 
Xz x 
=| {=| - (1-18) 
(n-1) 
Xx x 


The unit doublet function is defined as the derivative of the unit impulse function. 
?The unit triplet function is defined as the derivative of the unit doublet function. 
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then Eq. (1-17) becomes 


Ži = Xe 

Xe = He 
. (1-19) 
Xo. Sy 

Xn = —daQX, — ++ — ax, + bin +: e -+ bu 


Notice that in this case if u possesses a finite discontinuity at / = £,, then 
4 will possess the delta function at t = 5, and higher derivatives of u 
possess higher-order impulse functions, such as a doublet function, triplet 
function, etc. at z = 1. If the right-hand side of the last equation in Eq. 
(1-19) involves the doublet function or higher impulse functions, then the 
amount of the jump in the trajectory at the time of discontinuity can become 
infinity. This means that the state x(t) after the discontinuity at ¢ = t 
cannot be determined uniquely by the state just before the discontinuity 
at £ = f,, even though the input u(t) is known. ASrefOFG: 305. Xa, ses in 
defined by Eq. (1-18) cannot be a set of state variables. 

In such a case we can choose a set of n variables consisting of suitable 
combinations of derivatives of various orders of x(t) and u(t) such that the 
resulting trajectories will not exhibit infinite jumps in the new state space. 
(Note that finite jumps in the trajectory do not bother us, provided that we 
can compute the amounts of jumps. The amounts of jumps in the trajectory 
depend on the state space chosen.) 


Example l-4 Suppose that the plant shown in Fig. 1-7 is given by 


X(s) A S + 8s? -- 175 -- 8 
Uls) s$ + 6s? + Tis +6 
or 
V+ 64 + 11 + 6x = ü + 8ü + 174 4-8u (1-20) 
Assume that the input set in which « is an element consists of all functions of 
time ¢ involving up to the delta function, but not higher-order impulse functions, 
such as a doublet function, triplet function, etc. In this case x, X, and € cannot 
constitute a set of state variables. If we put 
x,=x—u 
Xa = X — ú + 6x — 8u 
X; =X — ü + 6% —85 + llx — 17u 
then Eq. (1-20) can be written as 
X; —Ó6x, + xy + 2u 
X_ = —llx, + x; + 6u (1-21) 


X3 = —6x, + 2u 


In the space, whose coordinates are x,, x, and X5, the trajectory of the motion 
starting from any initial condition point can be determined uniquely for a given 
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input « which belongs to the input set considered. Hence, x,, x, and x; are a set 


of state variables. 
Note that by solving Eq. (1-21) we can obtain x, as a function of r. Then 


x = x; + u, the solution to Eq. (1-20), can be determined as a function of t, since 
u is a known function of t. 
We can summarize what we have previously stated in regard to the linear 


system as follows: 
(i) Suppose that in the linear system defined by Eq. (1-14), rewritten as 


x= A(t)x+ B(t)u 
the elements of A(t) are continuous functions of time in the time interval 
0 < t — t, < T. If the elements of B(t)u are integrable (namely, the elements 
To 
of Í B(t)u dt assume definite values) for all u in the input set, then x is 
to 


a state vector for the system for the given input set defined for the time 
interval 0 < ¢ — t, < T. The space having x,, x,,..., x, as the coordinates 
is a state space. 
If, however, any of the elements of B(t)u is not integrable (namely if any 
T+lo 
element of f B(t)u dt becomes infinity or does not possess a unique 
to 


value), then x cannot be a state vector. Hence, if any element of B(t)u 
involves a doublet function, triplet function, etc., or if any element of B(t)u 
is not single-valued, then x cannot be a state vector. 

(ii) In the sys:em defined by 


X -a(t)x +--+ ba(t)x = bt)u + bt)u + --- + b.(tu 


lIzmain 
or 
Xi, = Ky 
Xe = Xs 
Kinny = Xa 
Xn = =AL) — +++ — h)a + Dall) 
where XQ X 
(m) (n-1) 
Fit) = bt) u + b,(t)u +--+ + bn(t)u 


if the forcing function u is not sufficiently smooth that p,(t) involves higher- 
order impulse functions, such as a doublet function, triplet function, etc., 
then x,, X», . - . , Xn cannot be a set of state variables. In such a case a suit- 
able choice of n variables may be made so that the system equation can 
be modified to 

y =C(t)y + D(t)u 


where no derivatives of u appear in the equation. Then y is a state vector for 


the Syst T the in i f me [unct on. nvoiv [0] 
em fo put set which consists [e] ti 


—À E S regm in analog computer simulation of differential 
on system an analog computer is used to Solve a system of dif- 
nals of the integrators constitute a set 


! , in an analog computer simulati 
/ : on of an nth-order 
differential equation System, the output signal of the System is alw 


uniquely determined by the initi iti i 
nitial conditions (which i 
grators) and the input functi ; mo E 
Paul : à E 
"wis abus time). This means that the n initial conditions which are set at 
grators at ¢ = f, are a set of state variables evaluated at t =f 
- 


Hence for t 2 t the outp t g m the n Integra [0] co 
Z to ut SI nals fro i H i 


Consider, for example, the nth-order differential equation 


(n) (1-1) 
X Lax + Ta uxc-ax-u 


By properly choosing scale factors, we can make the coefficients of the 


Figure 1-8 Analog computer diagram for nth-order differential equation X + eee 
Fa Rak ob ux n; i 


differential equation to be equal to or smaller than unity. Therefore, we 
can assume that such a change of time scale is already done and the coef- 
ficients 41, d», ..., a, are equal to or smaller than unity. Figure 1-8 is 
a diagram of analog computer simulation for this mth-order differential 
equation. In this diagram the output signals from the integrators are 
denoted by x,, —X., ..., —x,. Here, Xi Xo, ..., x, are a set of state 


variables. 
As an another example, consider a nonlinear system defined by 


E+ f(xy)+x=0 

where f(x) is a nonlinear function of x. We assume that /(X) is a function 
that can be simulated by means of an analog computer. Figure 1-9 shows 
an analog computer diagram for this system. The output signals from the 
integrators are denoted by x, and —x,. Here, x, and x, are a set of state 
variables. 

Note that what we have -stated here also applies to the differential 
equation system defined by Eq. (1-17). We shall give a detailed discussion 
of an analog computer simulation of the system of Eq. (1-17) in Chapter 4. 


Function 
Generator 


Figure 1-9 Analog computer diagram for system defined by X + f(x) + x — 0. 


Nonuniqueness of set of state variables A set of state variables 
can be associated with a given system in many ways. In other words there 
exist a number of different sets of state variables for a given system for a 
given input set. Which of these is most relevant depends on the individual 
situation, that is, the nature of the problem, the nature of the input set, 
etc. If, however, it is desired to construct physically an optimal controller 
synthesized by optimal control theory, then it is necessary to choose measur- 
able signals as a set of state variables because actual measurement of the 
state variables is required for realization of an optimal control law. 
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Suppose that x,, Xs, ..-, Xn are a set of state variables. We may take 
as another set of state variables any set of functions 


XQ— Mi Sys Kgs sate y Xx) 


AA 
Xy Was ues Na) 


^ 


acm Mea oreo Rem.) 


provided that for every set of values X,, Ñ», ..., X, there corresponds a 
unique set of values x;, x4, ... , Xn, and vice versa. ü 

As just stated, a variety of sets of state variables can be obtained b 
changing the coordinate system in the state space. Changing the set ot 
cdi variables will cause changes in the state space equation of the system. 
Hence the selection of a set of state variables affects greatly the analytic and 
computational effort required. 
Wc ds br cose a particular set of state variables, though they may 
iie y easurable, may simplify greatly the complexity of state 
spac representation of the system. Simplification of the state space equatio 
is discussed in great detail in Chapters 3 and 4. ELS 


1-4 OPTIMAL CONTROL SYSTEMS 


= ne igs control theory, attention was focused on the system's absolute 
a ility, transient performance, such as rise time, settling time, maximum 
overshoot, etc., and steady state accuracy. 
WE demands for control systems of increasingly high performance 
- : bs increasingly stringent requirements on specifications of transient 
: steady state responses. Synthesis of control systems which meet stringent 
— on specifications by classical control theory is, in fact, an 
3 Pme: difficult job. In particular, if the performance requirements 
se "s : need of a time-varying control law, classical control theory is 
ies : eid Merge uai For such problems, synthesis methods based on 

imal control theory offer distinct advanta i i 
anon ges over classical synthesis 
- In designing a high-performance system, we strive for an optimal con- 
Ld emet eges to the many and diverse engineering constraints which 
i oe cep dei over-all configuration of the system evolves from 
a consideration of various factors, such as perf i 

consic E ormance, efficien 
reliability, stability, etc. mw 
" im section deals with the basic problem of optimal control systems. 
OP imal control of a plant is finding the present control decision, subject 

certain constraints, that will minimize some measure of deviation from 
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the ideal performance. To make the problem meaningful, we must define 
a quantitative index of performance precisely. This is one of the most im- 
portant aspects of the optimal control problem. Note that the problem of 
defining a performance index is closely related to the problem of deriving a 
mathematical model of a given plant. 

In defining a reasonable performance index, physical intuition plays an 
important role. Some performance indexes lead to problems which are 
mathematically tractable; others lead to problems which cannot be solved 
even by use of digital computers. An example of many performance indexes 
which are mathematically tractable if used with linear systems is the gener- 
alized quadratic performance index, such as the integrated squared error 
between the actual state and the desired state during a fixed time interval. 
Constraints may be on the magnitude of the control vector, on the available 
control energy, or on both. 

Once the performance index is defined, the best possible system is that 
which minimizes this index subject to the constraints imposed on the system. 
Many investigations in the literature have been concerned with (a) optimal 
control of linear systems with various quadratic performance indexes, sub- 
ject to constraints on the control vector and/or the state vector, (b) time 
optimal contro’ of linear systems, subject to constraints imposed on the 
control vector. 

Problems which have been treated by optimal control theory may be 
formulated as follows: 

1. A plant to be controlled can be adequately represented by a mathe- 

matical model of the form 


x = f(x, u, t) 
or its discrete-time equivalence. 
2. The desired (or ideal) state of the plant or the desired (or ideal) 


output of the system is given. 
3. A performance index is given. It has the form of 


j= [ Ze u, £) dt 
to 


or its equivalent discrete-time expression. The integrand L(x, u, t) 
in the integral expression is called the loss function and represents a 
measure of the instantaneous deviation from the desired (or ideal) 


performance. 
4. Constraints on the control vector u and/or the state vector x of the 


plant are given. 


Once the optimal control problem is mathematically formulated a variety 
of techniques for the solution is available, including calculus of variations, 


donani s TM . 
- —€— age cpt Pontriagin’s maximum principle, method of steepest 
> etc. We shall not present various optimal i 
ques dee i puma! control problems in this 
: , all treat in Chapter 9 o i 
ne particular but basi 

a i - sic type of 

2 : I control problem; namely, optimal control of linear uerb uh 
quadratic or Hermitian performance indexes 


menting the optimal controller 
rese 
For W y 
s linear plants ith a mathematical] tractable performance index 
ve may be able to obtain analytically the optimal contr ol law. The optimal 
control law may however b y e 
.,1n the sens that 1t 1s 
time-var ying and/or nonlinear with extreme complexity, ete; The feasibility 


is not great, the latter may be preferable 
N * B . 3 B 
ote that, besides economic considerations, application of optimal con- 


I : wem 
Birt eue iei cera tnat investigations based on optimal 
will greatly help in understandin i i 
1 g and developing theor 
Which eventually will lead to practical systems close to ene ick 


systems may possibly include computers for the generation of control 
signals. Design of such systems cannot possibly be accomplished by classical 


control theory. 


1-5 CLASSICAL CONTROL THEORY VERSUS 
MODERN CONTROL THEORY 


In concluding this chapter we shall compare classical control theory and 
modern control theory from various viewpoints. 


1. Classical control theory is best suited for design of single-input 
single-output linear time-invariant systems. The design is based on the 
transient response and frequency response characteristics of the system. 
In classical control theory severe limitations and difficulties exist for design 
of multivariable control systems and time-varying control systems. Design 
by modern control theory based on state space concepts can be carried out 
for a class of inputs instead of the impulse function, step function, or 
sinusoidal function. Modern control theory can be applied to the design of 
linear multivariable control systems and linear time-varying control systems 
that are optimal with given performance indexes. á 


2. Experimentally, a complex system can be characterized easily in 
terms of frequency response curves. Frequency response tests are, in general, 
simple and can be made accurate by use of accurate sinusoidal signal 
generators and precise sinusoidal measurement equipments. It is usually 
difficult to write a.set of differential equations representing the dynamics of 
a complex system so that modern control theory can be applied directly. 
Hence, from this viewpoint, classical control theory is more convenient than 
modern control theory. 


3. In classical control theory, algebraic manipulation of the parameters 
can be made before numerical values are substituted. This is not the case 
in modern control theory because of its extensive use of vector matrix nota- 
tion. Vector matrix notation greatly simplifies description of system equa- 
tions. Notational and conceptual simplicity of vector matrix operations is 
essential for theoretical work. Although the analytical solution in vector 
matrix notation may seem simple, if the numerical solution is required, 
computation becomes extremely involved. Except in simple cases, we must 
resort to digital computers for necessary computations. (In obtaining 
computer solutions we must give the numerical values for all parameters.) 


4. Inclusion of the initial conditions in the system design is impossible in 
classical control theory. Modern control theory, on the other hand, enables 
us to include the system's initial conditions in the design. 
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5. For high-frequency noise rejection in control systems, classical fre- 
quency domain design techniques are more convenient than state space 
techniques. 


From the brief comparisons just made, we can conclude that although 
modern control theory is more general than classical control theory, the 
latter will not be discarded unless the former becomes, in future, better 
in every respect of control system analysis and synthesis. 


Outline of text In this book we shall be concerned only 
with deterministic systems. The system is called deterministic if the future 
states are completely determined by the present state provided that the 
future inputs are known. 

The contents of the text are as follows: Chapter 1 presents an intro- 
duction to state space analysis of control systems. Chapter 2 reviews intro- 
ductory material on matrices and vectors. Chapter 3 discusses reduction of 
square matrices to the Jordan canonical form and coordinate transformation 
in state spaces. Chapter 4 deals with state space representation of control 
systems. Chapter 5 presents additional materials on matrix analysis which 
are needed for studying Chapters 6, 7, 8, and 9. Chapter 6 gives formal 
solutions of state space equations. Chapter 7 presents detailed discussions 
of controllability and observability of control systems. Chapter 8 treats 
stability analyses based on the second method of Liapunov and on Routh- 
Hurwitz stability criteria. Some applications of the second method of 
Liapunov to design problems are included. Finally, Chapter 9 discusses 
analytical design of optimal control systems based on quadratic perform- 
ance indexes. 

Throughout the book an attempt has been made to present basic mathe- 
matical techniques necessary for understanding modern control theory in 
the simplest and easiest way possible. A number of simple examples are 
provided at the end of each chapter as Exercise Problems and Solutions. 
Those who are as yet unfamiliar with modern control theory will find these 
examples useful for understanding the materials presented in the book and 
for learning techniques for analyzing problems in modern control theory. 


EXERCISE PROBLEMS 
AND SOLUTIONS 


Problem A-I-1 Hamilton showed that the first variation of the time 
integral of the difference between the kinetic energy T and the potential energy 
V of a dynamic system is zero, or 


ts 
afi T- V)dt=0 
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This is known as Hamiltons principle. Derive Hamilton's densis yw veg 
bert's principle, which states that the applied force, together wit > e for sia 
inertia, forms a system in equilibrium. For a system consisting of n par 3 


d'Alembert's principle can be stated analytically as 
n 2 d? Vi dz, - i av 
> Lm. € -— x) 8x, + (m. aa Y) 8y, + (m. di? Z) 82; | 
i (1-22) 


Solution Since 


E X [m (T 8x, + ae òy: + i à) | 
~ 405 e] +) (5) 


+ p? (X: 8x; ue Y, òy: a+ Zi 8z,) (1-23) 


=l 


LS ml (Se) EN + BY) 
LEG) +a +F | 
e system. If there is a potential function we have 


Here, 


is the kinetic energy T of th 
X (X, x, + Y 8 + Z 82) = —8V 
t 


Integrating Eq. (1-23) with respect to t from 1, to fs, 

c dx; dy; dz; )T = É T — 8V)dt 

PIE EC 8x, Tun + d Oz, ^ ,9 ) 
If the positions of the particles of the system are fixed att = f; and i = - I 
variations 8x,, 5y;, 8z, vanish for t = f and t =f». The left-hand side o e 
preceding equation then vanishes. Hence 


T" D 
èf E V) dt 


This is Hamilton's principle. | 
Problem A-I-2 Suppose that there are r masses whose coordinates 
are (xi, Xe, X3), (Xas Xs Xo), < -o (Grot Xar-v x4,). Suppose also that p a. 
on r masses are (Xj, Xs, X3), (Xs, Xs, X), oca Aaron Mees X57). Newton's 
equations of motion are given by 
mj, = X, (9 1, 2,2. 37) 


< 3r 
where m, = mg = mii... Msr-2 = Msr-1 = Msr: If there are s (where s < 3r) 


constraint equations of the form 


Xi = xií(qd,. d» -+ -s In) ( ee V) z (xr 2 la = 
where n = 3r — s, then show that the kinetic energy of the system can be a function J 6 Mel 24i dt odi 
of both the 4; and q; (i = 1, 2,..., n). Since the preceding equation must hold for any 8q;, the coefficient of every òq: 
Solution Notice that must vanish, or 
gas ar—V)_ 4 (XT) = 9 G= hra 
UC og $a; dt\ od: i 
Hence, If we substitute L for T — V in the foregoing equation 
3r ar n m 
Tav Ud LSU EX; OX: 5 4 (2L) - 2% — o AEE WT 
E 2* > 2 EZ 09; 64x agak dt \ôġi ôqi ( 
i E ) X (X mi 2u x) jk These are Lagrange’s equations of motion. . . 
jai k=l Mz1 95 ĉr Problem A-1-4 Derive Hamilton’s equations of motion 
1 n n , 
— 722 Qd jd. a=, p= —?2H (i 29 1,2; 91) 
where Pi di 
Ox; 0x where H = T + V = Hamiltonian 
Qi = D m 3q; qu from Hamilton’s principle i 
Note that the Q;, are constants or functions of q; (i = 1, 2,..., n). Hence T can a f ut dms 
TEUER D where L =T — V = Lagrangian 
Problem A-1-3 Derive Lagrange's equations of motion ; The variables p; are defined by 
oT ; 
Ad(0LY 2L . ; gsm. lieb. 
from Hamilton's principle Solution Since the Hamiltonian function H is defined by 
t: H = T+ V 
è [La =0 m 
i the Lagrangian function can be written as 
where L =T — V = Lagrangian taf. v=2T-— 28 
Solution The kinetic energy T is a quadratic function of g;, d», ..., Here T and H are functions of q; and p;. Since T can be written as 


qn and can be written as 
n n 
T= a Y È 4:54:49; (aij = 5) 


i=l j=l 


ID X "P 
EO E " Q;.d;d. 


i / a;; are constants or are functions of q,, ds - ++» Fn» WE obtain 
where the Q,, are constants or functions of q; (i = 1,2, . . . , n). The potential energy where the aj; 
vil Function. ouly ekg, aud nota function otg. Erot Hamilton's principle; p, e 2T. aids + audi nds 
we obtain 92: 
X (à [e x SET ag ] it —0 P» =o = andi + dads coo + Amnn 
nl de 57 mu Ser 7 
By integrating the second term by parts, noting that - 
Dra 3; 7 anid E anoa + tob annin 
8j, = s% = as Gn 
a” We see that 


and that the 5g, vanish at ¢ = 4, and t = f, we obtain f im Ea Ódi z4 
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Hamilton's principle then becomes 
fts tayn 
af ar-ia-s[*( EERE ) 
tí t È 26, 1 H jae 


-5 f? (È på- H)a 


ti 
tla n 
= n » (à. òp: + p, 84; — an p. = Fan ba.) dt =0 
Noting that 


t=t: 


tz 
fin 9d; dt = p, $a. 


and ôq; vanish at f = ¢, and t = tə», we obtain 


f; (s - 28) on. — (a. 25.) 8a, ] t = o (1-24) 


fr isl 


ts E 
= att òq; dt 


t-t 


Since V does not involve Pi 


aH _ eT 

On p 

It can be shown that T ú 
ar _, 

op, “s 


We will demonstrate this for a simple case where 
T= langi + 24124142 + 095) 
By definition, p, and P» are obtained as 


Pi = 5 = 4119: + aĝi 


D» = Od = did; + Oo» Qs 
Solving for g, and d», We obtain 


= Imp) + anp: 


—dmPi — pe 
41122 — à, 


à . 

a 910» — aj,” ” 

N isa x : a 

TRE 411422 — aj, > O, since T IS a positive definite quantity. (Sec p. 79.) 
etic energy then can be written in terms of p, and p, as follows: 


1 
VERE ERROR = 
Xa,.as — al) (ase P? — 2ajspips + 413) 
By differentiating T with Tespect to p, and p,, respectively, we obtain 


oT = S22P1 — GaP _ 


Op 411055 — at, i 


OT _ =a sp, + anp: — 
Ops uds — ah 1s 
or 


ÉL S 
ap, 4 (i = 1, 2) 


M . : : 
Ve can easily extend this result for i = 1,2, .... n. It is simpler. however ta nce 
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matrix algebra to show this for a general case where i = 1,2, ..., n. (Sec Prob. 
A-5-13). We thus obtain 
LT _ 2H 
S Opi ER Op, 


The coefficients of the 8p, in Eq. (1-24) vanish. Hence Eq. (1-24) becomes 
té] e. fs 2H) ] 
; +57 ) 84; | dt =0 
b (4, t 3a. 4 
Since the ðq; are arbitrary, their coefficients must be equal to zero, or 


. , OH : 
Brg, =o fie 1D, ast) 


We have thus derived Hamilton’s equations of motion: 


eee uu 
4 = Fp pi = Iq; (122.1, 2, E) 


Problem A-1-5 Figure 1-10 shows a double pendulum. Obtain the 
Hamiltonian function H for this systems. Assume that the pendulum swings in 
one plane. 


Figure 1-10. Double pendulum. 


Solution Let us define generalized coordinates q, and q, by 
qı =, ds = 6, 
In this system the mass m, has risen from its equilibrium position a distance 
1,1 — cos q;) and the mass m, has risen from its equilibrium position a distance 
1,1 — cosq,) + L(1 — cos q). The potential energy is therefore given by 
V = mygl(1— cosq;) + mg[l(1 — cos gy} + (1 — cos q?)] 

Here we have regarded q, = 0 and q, = 0 as the zero potential energy datum. The 
x and y coordinates of the displacement (x,, yı) of the mass m, are given by 


x, =l sing, yı = —l, cos q, 


and those of the mass m, are given by 


x. = l, sina. + h sin as. Yə = —/, cos q, — Í, cos qs 


Hence, the square of the velocity of the mass m, is given by 
vi = ži H = (hå) 
and the square of the velocity of the mass m, is given by 
w= HTH 
= (hå)? + (ds) + 2lhlsdids cos (q, — Ge) 
Therefore, the kinetic energy is given by 


m; 5 23 pl. xix 2 29 
T= (3 iic 2 n)at Tomjllsdids COS (q, — q2) + e n) 


1 E we " 
= (an? + 26129142 + 42293) 


where tja ES png m 
Qj. = mall, cos (qi n q) 
a» = ml} 


The generalized momenta are 


Thus 
Pi = n = 4nd do _ Pr = 2 
oq 91 my’ ds =m, 
D» = ay = isa T aga j 1 n E (2) 
od | T=5 (2) — 5 m. m, 


From Prob. A-1-4, T can be written in terms of p, and p, as follows: | . 
i The Hamiltonian function for this system is 


T = ys (nn p? — 2a piP: + anp?) 
5 PEN 221 12P1P» 1125 2 2 R 
asi ass — aia) = Pi Edge kla — a 
Hence the Hamiltonian function is XE M q 
H=T+V Hamilton’s equations of motion are 
— mol3pi — 2msllpyps cos (qı — qe) + (mi + m)lips ONCE eH _ Pr 
mi Blm, +m, — ms cos? (a, — 4:)] f= 3p; m 
+ Gn, + m)gl(1 — cosq;) + mgla(1 — cos ge) — 9H _ p: 
Problem A-1-6 Obtain the potential energy and kinetic energy of ] Wi. M; 
the system shown in Fig. 1-11. Then, derive Hamilton's equations of motion. A= 2 = —kiq, + klg: — qı) 
Solution Let us define generalized coordinates q, and q, as follows: Ee 
n=, q2 = Xs D: = 7 $a. = —kiq: — 4) 


Titdimetitenengyef the: systems which are equivalent to the following familiar equations 


mj, Kx kn — x;j-0 
Maža + ks(xo — x) —0 


Problem A-1-7 Consider the mechanical vibra- 


T= imd T imd; 
The potential energy of the system is 
= git klg — qi) 


Oe ee eee c e er | tory system shown in Fig. 1-12. By defining a set of state 
equilibrium position.) The generalized momenta are defined by | variables, obtain a state space equation of the system. i E 
| Es . . igure - 
es e = m4; Solution The equations of motion are given by Nechanic 
or my, + ky, — ks — y) = 9 vibratory 
a. der | mj, + kQs — Y) + ky: = 0 system. 


36 INTRODUCTION 


By defining 
: X, = Y Xs = Jy Xs = ys x= 
we obtain : "TE 
Xp = Xo 
2k k 
X. = — — — 
2 m xp + p X; 
Xs = X, 
2k 
X, = — — — 
Um ES 


In this system x,, x. : 
1» Xs, X3, and x, are a set of state variables. Usi s 
i : : ng vector 
notation we obtain s or matrix 


x= Ax 
where 
[ x 0 14 6 64 
Xs -2E 0 ik: 0 
x= : ime 1 m 
| Xs 0 0 0 1 
k 2k 
"ij m x m 
P a : x 
— € roblem A-1-8 Obtain a state space equation of the system shown 


Solution The equation relating x and u is 


X+xtat)x+x=—y 
Define x, = x, and Xo 


= X. Then, a state space i 
eed p representation of the system 


Žž% =X 


ž = —x, — a(t)x, — x? + u 


2 Time Varying 


Figure 1-13 Nonlinear system. 
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Problem A-1-9 (1-3) Derive 3 8, 
the equation of the three axes reaction-wheel 
system shown in Fig. !-14. Assume that the 
reaction-whee! spin axes are aligned with 
principal inertia axes and that the effect of 
small misalignments between these two sets 
of axes can be neglected. Assume also that : 
the inertial ellipsoid of the spacecraft is time 6, 
invariant. 

In the figure, the three axes shown are rigid 
spacecraft axes. In order to derive the equa- Figure 1-14 Three axes reaction- 
tion, define wheel system. 


6, = angular velocity about the ith axis (that is, the quantity 
measured by a rate gyro about the ith axis) 
T; — external torque about the ith axis 
H; = total angular momentum component along the ith axis 
=h d L6, + 8i 
where h; = angular momentum of reaction wheel about the ith axis 
LÖ; = angular momentum of spacecraft body about the ith axis 
g; = ith angular momentum component of rotating parts 
within spacecraft (such as rate gyros) excluding /j; 


Solution 
Ti H, LA — Heb; 
T,| —^|H,| + | Hô, — H6, 
Ts H, rn — Hif, 
[h 16, & 
= f + | n6,| + | 
lis IP, és 


[Uis + 236. — (he +8)0] [ (la — 1): | 
T] On de £6, — (hy + £36 33 |o = 1,)0,0s 
(hy + 82)6, — (n + £)6- (i — 1,0,6. J 


Problem A-I-I0 Figure 1-15 shows a multiple-input multiple-output 
system. Obtain a state space equation of this system. 


Solution 
PN —1 ind | 10 bd 
- = 
[a] l 4 _7| hs —-5 5 te | 
Problem A-1-11 Consider the ordinary differential equation of the 


first order 
X(t) = flt), t) (1-25) 


!Sce list of references at the end of this chapter. - 


Figure 1-15 Multiple-input multiple-output system. 


where X(t) = Xo 

Prove that if f(x(s), ) is single-valued, continuous, and bounded in the region T 

defined by |x — x| < A, 0 Ct — t, <7 and if the following Lipschitz condition: 
fes t) — f 0| € K|x — z| 


where points (x, t) and (z, t) are arbitrary points in the region T and K is a positive 
constant, is satisfied, then the given differential equation has a solution satisfying 


the initial condition x(t) = x). Prove also that the solution x(t) defined in the 
interval 


Og t—t,«7T, T= min(7, 7.) 


where M is the maximum value of |/(x, 1) |, and satisfying the initial condition is 
unique. 


Solution We shall first prove the existence of the solution. Define 


x(t), x(0),..., x4, ... in the interval 0 € £ — t, < T as follows: 
G) xolt) = Xo 
x 
(ii) Xysilt) = xo + |, fest), 0 d (1-26) 


We must prove that such definition is possible. In order for x,,, to be defined by 

Eq. (1-26) it is necessary to show that f(x,(t), t) can be defined in the interval 

0 « t — t, <T and is continuous. To show this we need prove only that 
|Xn(t) — xo| < A, O<t— Hex TF (1-27) 


For n = 0, inequality (1-27) is clearly satisfied. Suppose that | x,-,(t) — x)|< A. 
Then by assumption 


fart), 01 <M 
Then from Eq. (1-26) 


xt) — xol = |, fes D dt | MG = 09 


On the other hand 
h 
Oo<t—%<T< M 
Hence we have proved that 
|xn(t) — x < M(r—t)«h 


Thus we have shown that the function series x(t), xf), - +> x,(t),... can be 
defined as given by Eq. (1-26). Note that each x(t) is continuous. 

We shall next prove that the function series converges to à function as 7 ap- 
proaches infinity. To prove this we need to show that 
ENGA — (—0,1,2,...) (1-28) 


n! 


| xai) P xn(t)| < 


where N is a positive number. We shall prove this by mathematical induction. 
Inequality (1-28) clearly holds for n = 0, since x, and x, are continuous there 
exists a positive number N such that |xi(£) — x4] = N. Suppose that inequality 
(1-28) is true for n — 1, then 


(Xart) — x1 = (x P [fest t) dt) - (x. je [fes 0, t) dr) 
=| fi [V D -Samt Z2 


«|f, lx — xaO a| 


IN 


a d DE 
= ef er + 


_ K"N(t — to)” 
E n! 
Thus, we have proved inequality (1-28). Hence, 
Xa = Xo + (X1 — Xo) + (x2 — X1) Fee + (Xa — Xn-1) 
n-1 
xy X Gui — Xx) 
k=0 


converges uniformly. Therefore 
x(t) = lim x(t) 
Sars 


exists and is continuous. By letting n — co in Eq. (1-26), 


t x X 
x(t) = xo + | fo. 0 4t, Oļ[lt—t <T (1-29) 
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For Oc t — t, < T, we can differentiate Eq. (1-29) with respect to z, or X(t) 
= f(x(), t). Clearly, x(t) = x). This proves the existence of the solution satisfying 
the initial condition. 

We shall next prove uniqueness of the solution. Suppose that X(¢) is another 
solution of Eq. (1-25). Then 


R(t) = x, + fÉ S (R(t), t) dt 


Since x(t) — X(t) is continuous in the interval 0 X t — t, « T, it has a maximum, 
G. Then we can prove that the following inequality holds true: 


Ix) - 30] Kc — 9 — 91,2...) (1-30) 


To prove this, we shall use mathematical induction. Clearly this holds true for 
n = 0. Suppose that it is true for n — 1. Then 


ix) — 201 = f. Dre. 0 - 78. 0] | 
«|f, kixo - 2C) ar | 


E KEG — t,)"-! 
«|f, re a= di 
= kg Ct» 

n! 


This completes the proof of inequality (1-30). By letting n — co, lxt) — X(t)| 
— 0. Hence 

x(t) = R(t) 
This proves uniqueness of the solution. 


Problem A-I-12 (Consider the following simultancous first-order 
differential equations: 


ži = f(X Xo, t : 
h AiG xs D) ndn 
Xs = fo(X1, Xo, t) 

where X(t) = xi, Xo(fo) = Xz 
Assume that fi(xi, xs, t) and fX, Xs, t) are single-valued and continuous in the 
region T defined by 

Il; —xel€4^ |n —x,|«h  0O€t—-t«« 
and |f| € M, || € M in the region T. Assume also that f, and f» satisfy the 
following Lipschitz conditions: 

fil Xa t) — fi(zis Za DIL Kx; — z| 4 |x, — Zal) 

LG, xs, t) — faZi zs, t)| < K(x, — z| + lx: — Zal) 
where points (x,, x», ¢) and (zi, zo t) are arbitrary points in the region T and K is 
a positive constant. 

Prove that there exists a unique solution of the differential equation system of 


INTRODUCTION Al 


Eq. (1-31) satisfying the given initial condition and defined in the interval 
P h 
0ct—t,«T, T = min(z, 4) 


Solution The proof essentially parallels that for Prob. A-1-11. 
Therefore, we shall give only the outline of the proof. 
To prove existence of the solution define 


Xie) = 2 + f fien, xa), 1) dt 


Xo naa(£) = Xo + EEO) X2,n(t), i) dt 
(i0, 1, 2,. ...) 


(A) 


Function 


Generator 


-K sgn x-ax 


(8) 


Figure 1-16 On-off control system. (A) Block diagram; (B) analog computer diagram. 


where Xyo(f) = X10, Xa (f) = Xap 


and show convergence of the series, obtain x;,,(¢) and xe,,(f) and then let n ap- 
proach infinity. This will result in 


x(t) = xiy + | Si(xi(t), x(t), t) dt 


x«t) = xa + [^ Alle), xs), 0) dt 


To prove uniqueness of the solution, assume X,(t) and %.(t) as another set 
of solution of Eq. (1-31). Then prove that X,(f) — x,(t), and X(t) — x(t) as 
H —» co. 

The proof can, of course, be extended to the system of n first-order differential 
equations. 


Problem A-1-13 Obtain a state space equation of the on-off control 
system shown in Fig. 1-16(A). Assume that we are interested in the response of 
the system subjected to the initial condition only. 


Solution The differential equation of the plant is 


X + ax = Ku (1-32) 
where u = — sgn x 
By defining 
xX; =x, Xp =X 
Eq. (1-32) can be written as 
aioe (1-33 
ž = —K sgn x, — ax, icm: 


Notice that —K sgn x, — ax, is a piecewise continuous function having discon- 
tinuities whenever x, changes its sign. Since —K sgn x, — ax; involves only dis- 
continuities of the magnitude 2K, Eq. (1-33) possesses a unique solution satisfying 
the given initial condition. Hence x, and x, are a set of state variables and Eq. 
(1-33) is a state space representation of the system. 

Figure 1-16(B) shows an analog computer diagram for this system. Observe 
that the output signals from the integrators can be a set of state variables. 


PROBLEMS 


Problem B-1-1 A particle of mass m moves under the gravitational 
field. The gravitational force is defined by the following potential function: 


V= —m— 


where r is the distance from the origin of the gravitational field and & is a con- 
stant. By using polar coordinates r, 6, obtain the kinetic energy T of the particle. 
Then define 

n=", qs = 9 


oT oT 
pi = 24, Pe Odo 
and derive Hamilton’s equations of motion of the particle. 


Problem B-1-2 Consider the system shown in Fig. 
1-17. By defining a suitable set of state variables obtain a state 


space equation of the system. 
Problem B-1-3 Consider the following system: 
x +f + g(x) =0 
Define P 
F(x) = [ * fo) de 
and x, =X, Xs — X + F(x). Show that the given system can be 


written in the following form: 
x = f(x) 


where 


r Figure 1-17 
f(x) = [e zi e iul Fo) Mechanical 
pA 620p 29) L —g(x) J vibratory 
system. 


Problem B-1-4 Derive a state space equation of 
the system shown in Fig. 1-18. Here, xi and x, are a set of state 
variables and u, and u, are control variables. 


stem. 


Figure 1-18 Multiple-input multiple-output Sy 
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f Problem B-I-5 Figure 1-19 shows a simple hydraulic system. 
Derive a state space equation of the system. Take 


x(t) = fest = be 
X(t) X(t) 
as a state vector. In this system u(/) is the control function. Assume that oil leakage, 


compressibility including the effects of dissolved air, expansion of pipelines, etc. 
are negligible. 


Power 
Cylinder 


Power Piston TOTO 


Figure 1-19 Hydraulic system. 


Note that if we define 


S = power piston area in.? 
Q — Sx — rate of flow of fluid to the power cylinder, in.?/sec 
AP — pressure difference across the power piston, psi 
= displacement of pilot valve, in. (control function) 
then Q can be given by 
Q = Ku — K, AP 
where K, and K, are constants. 
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Review of | 
Matrices and Vectors 


2-1 INTRODUCTION 


The principal objective of this chapter is to review elementary vector matrix 
analysis.! Section 2-1 presents mainly definitions of matrices and associated 
terminologies. Section 2-2 gives some of the rules of matrix operations. 
Section 2-3 discusses briefly vectors and vector spaces. Finally Section 2-4 
presents definitions of terminologies commonly appearing in matrix analysis. 
Discussions of matrix polynominals, Cayley-Hamilton theorem, the minimal 
polynomial, the exponential function of a matrix, etc. are postponed to 
Chapter 5. 
We now give definitions of matrices and associated terminologies: 


Number fields A set F of numbers containing at least two 
numbers is a field if a, b are any two members of F, then a + b, a — b, ab, 
a/b (where b = 0) are also members of F. For example, the set of all real 
numbers constitutes a field, and so does that of all complex numbers. The 
set of integers, however, is not a field, since a/b (where b + 0) may not be 
a member of F. Similarly, neither the set of positive numbers nor the set of 
negative numbers is a field. 


!Readers interested in details of vector matrix analysis refer to specialized works, for 
example (2-7, 2-2, 2-3, 2-4 among many others). The figures in parentheses refer to books 
listed at the end of this chapter. 


In this book the fields that concern us are the real field (the set of all 
real numbers) and the complex field (the set of all complex numbers.) 


Matrix A matrix A over a field F is a rectangular array of 
numbers in F. If A is over the field of real numbers, A is called a real matrix. 
If A is over the field of complex numbers, A is called a complex matrix. 

In this book we may not mention each time the field to which the ele- 
ments of the matrix belong, but we shall assume that such a field has been 
given. 

in a matrix the number of columns, in general, is not necessarily the 
same as that of rows. Consider the following matrix: 


[4n da 7 01 
a21 Aaa *** om 

A=]. $ $ (2-1) 
Ani Ana iic Anm 


The number standing in the ith row and jth column is denoted by a;;. The 
numbers a; are called the elements of A. This matrix has n rows and m 
columns and is called an n x m matrix. The first suffix represents the row 
number and the second suffix the column number. The matrix A given by 
Eq. (2-1) is sometimes written as (a;;), where we indicate only the (/, /)th 
element. Two matrices are called equal if and only if their corresponding 
elements are equal. 

Matrices are closely related to linear transformations. Consider a system 
of n linear equations 


Gy XQ isXo b+ + AymXm = 2, 

Qoi X F os Xo + +++ F am Xm = Z: 

AnyX, + AneXe + $ee + AnmXm = Za 
This set of n equations assigns a unique set of n variables z,, Z,,..., Zn to 
every set of m variables x,, X», ... , x4. Such an assignment is called a /inear 
transformation of the set of variables x,, x,, ... , Xm into the set of variables 
Zj Zo, +--+, Za. This linear transformation is characterized by the array of 


nm coefficients a; as given by Eq. (2-1). Hence A = (a;;) is called the 
matrix of the linear transformation. 


Square matrix A square matrix is a matrix in which the 
number of rows is equal to that of columns. Such a square matrix is called 
a matrix of order n, where n is the number of rows or columns. it is com- 
monly called an n x n matrix. 


Trace  Thetrace of ann x n matrix A is defined by 


trace of A = tr À = x dà 


Diagonal matrix Then X n matrix having the elements a,;, 


yo, ... , Ann in its main diagonal and zeros in all other positions is called 
a diagonal matrix and is written as 
ay, 0 
Aag 
A= - = (4:59) 
0 Ann 


where the 8,; are the Kronecker delta defined by 
s=1 (i=j) 
=0 (Qj) 
(Note that all the terms not explicitly written in the matrix are zero.) The 
diagonal matrix is also denoted by 
diag (4,,, ds», ---» Ann) 
These notations are often extended to cover submatrices. For example, if 


A Aya Q33 34 
A = > = 
As, Are As3 gs 
then the matrix i 
au xis 0 0 
I 
dn awto 0 
eamm, UE SC 
0 O ! 433 as. 
i 
0 O )44 Ags 


may be denoted by 
A 0 
l B 
A matrix whose elements are divided into submatrices by vertical and 
horizontal dotted lines is called a partitioned matrix. We shall give a detailed 
discussion of partitioned matrices later. 


| or  diüg(A, B) 


Identity matrix or unity matrix The identity matrix or unity 
matrix I is a matrix whose elements on the main diagonal are equal to unity 
and all other elements are equal to zero, 
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I 0 «e 6 
ti 4 vt? = diag (1,1, ... , 1) 
» d ud 


The identity matrix I is sometimes written as I = (6;;) where the 8;; are the 
Kronecker delta. If it is necessary to indicate the number of rows (or col- 
umns) of the identity matrix we shall put the suffix corresponding to the 
number of rows (or columns). For example, an n X n identity matrix will 
be denoted by Iņ. 

The identity matrix I is uniquely determined by the relation AI = IA = A 
for all square matrices A. Therefore, if AB = BA for all B, then A is a scalar 
multiple of I. (For multiplication of a matrix by a matrix, see Section 2-2.) 


Scalar matrix A diagonal matrix A whose elements are equal 
to one another is called a scalar matrix. 


Triangular matrix A matrix whose elements below (or above) 
the main diagonal are all zero is called a triangular matrix. Examples are 


s jg dis dg du Diu. “6 
0 G29 Ass Ans ds de O 0 
0 0 fis d ds Gx, dy O 
0 0 0 Asa Ası Ayn Ags Qu 


Nuil matrix The null matrix 0 is determined uniquely by the 
relation A0 = 0A = 0 for all A. ` 

Transpose If the rows and columns of an n x m matrix A 
are interchanged, the resulting m X n matrix is called the transpose of A. 


The transpose of A is denoted by A’. Namely, if A is given by 


Ai Are Aim 
Gi, Gag s 
A= 5 : 
Anı Ana Anm 
then A’ is given by 
anu Any Oni 
P Gi; az anz 
Aim Am Anm 
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Note that (A^ = A. We can easily verify that if A + B and AB can be 
defined (see Section 2-2), then 


(A--B)-—A'--B, (ABY = BA’ 


Conjugate matrix The conjugate of a matrix A is the matrix 
whose each element is the conjugate complex of the corresponding element 
of A. The conjugate of A is denoted by A = (4,;), where à; is the complex 
conjugate of a;;. For example, if A is given by 


0 1 0 
A-|—lI-cj —3—3j -1+4 (2-2) 
—l+j -—1  —243j 
then 
0 1 0 
ksimi +e d 
—)—-j —l -2-3 


Conjugate transpose The conjugate transpose is the conjugate 
of the transpose of a matrix. Given a matrix A = (a,;), the conjugate trans- 
pose is denoted by A’ or A*, namely, 

AAP = (à5) 
For example, if A is given by Eq. (2-2), then 
0 —1—j —1-j- 
À'—A*- | —343  —1 
0 —1—4j —2—3j 
Clearly, the conjugate of A’ is the same as the transpose of A. Note that 
(A*)* = A. We can easily verify that if A + B and AB can be defined, then 


(A + B)* = A* + B*, (AB)* = B*A* 
Note also that if c is a complex number, then 
(cA)* = cA* 
If A is a real matrix (a matrix whose elements are real), the conjugate 
transpose A* is the same as the transpose A’. 


m-rowed minor An m-rowed determinant obtainable by 
deleting n — m rows and n — m columns from an n-rowed determinant is 
called an m-rowed minor. 


Minor M;; We shall employ the notation Mi; for an (n — 1)- 


rowed determinant obtainable by deleting the ith row and jth column from 
an n-rowed determinant. 


Principal minor A minor of a determinant is called a principal 
minor, if its diagonal elements are also diagonal elements of the determinant. 


Cofactor A,, The cofactor A,; is defined by A,; = (—)'**M;;. 
The cofactor A,; is the coefficient of the term a,j in the expansion of |A|, 
the determinant of A, by minors, or 


Q;, Ay + dis Ais + +++ T Qin Ain =|A| 


In this book we denote the determinant of A by |A| or det A. If a,,, 
Qin, -- + » Qin are replaced by a5, Ajo, .-. 5 Ajn, then 


Gj Ai + djs Aye + +++ + Ajn Ain = 0  z Jj) 
because À can be seen to possess two identical rows. Hence we obtain 


» ajk Aj = Õu lA | (2-3) 


Similarly, 


> ari A; = 8,3 |A | (2-4) 


Adjoint of square matrix The adjoint of a square matrix A 
is the transpose of the matrix obtained from A by replacing each element of 
A by its cofactor. Namely, the adjoint of a square matrix A is a matrix 
B = (bj), where b;; = A;. The adjoint of a square matrix A is written as 
adj A. The element in the jth row and ith column of the product A(adj A) is 


p» Oj Dui = D ajk Aun = 85: |A| 
k=! k=1 


Hence A(adj A) is a diagonal matrix with diagonal elements equal to |A |. 
Thus A(adj A) = |A|I. Similarly, the element in the jth row and the ith 
column of the product (adj A) A is 


x bag = E Anjax: = Õu |A| 
Hence we have the relationship 
A(adj A) = (adj AMA = |A|I (2-5) 
For example, given a matrix A 
i 2 0 
A=|3 =i -2 
1 0 —3 


adj A is obtained as 


"DM 2 of 
Peuri F 
, 3—2) |n 0 1 0 
— + 1 sd l a 4 j = 
3 =] i a i 2 
| 1 | Aa ol | 3 al] 

8 « =i 

aly a 6 

|! £3 =7 

and 

i6.-3—91[3 X = 100 
hind) = |S —1 —A||7 —3  à|e16 i 9 
i walla 2 -4 001 


Nonsingular matrix and singular matrix A square matrix A 
is called nonsingular if a matrix B exists such that BA = AB = I. If such a 
matrix B exists, then it is denoted by A~!. A^! is called the inverse of A. The 
inverse matrix A^! exists if | A | is non-zero. If A^! does not exist, A is called 
singular. 


Inverse matrix The inverse matrix A~', which exists for a 
nonsingular matrix A, has the property that AA" = ÀA''À = I where I is 
the identity matrix. Thus, (A7!)'! = A. If AB = C then, A-'AB = IB = 
B = A-'C. If A and B are nonsingular matrices, then the product AB is 
a nonsingular matrix. Moreover, (AB)! = B-'A™'. This may be seen as 
follows: 


(BA) AB = B^ (A^A)B = BIB = BOB =I 
Similarly, we obtain (AB) (B~'A~') = I. Hence we verify that 
(AB)" = B'AT: 
Notice that for a nonsingular matrix A, 
A^* = (AA: Ay! = AATA = (A7))* 
Also 
(A)? = (A7) 
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This can be proved by taking the transpose of 
AAS = ATA =I 
Namely, 
(AvYA’ = AA) I =I 


it follows that (A~')’ is the inverse of A’, or 


(A) = (A? 
Similarly, we have the following relationship: 
(A*)'! = (A= 
From Eq. (2-5) and the definition of the inverse of a matrix, we have 
-1 _ adjA 
ES 


Hence the inverse of a nonsingular matrix is the transpose of the matrix of 
its cofactors divided by the determinant of the matrix. If A is given by 


Qj Gi >>> Ain 
rye Az As >>- Gan 
Anı Ang Roe Ann 
then 
An As, An 
|A] [A] IA] 
As dX, A 
dj A 8 = 
A^ = “44 —| JA] [Al EY 
EY à : 
Ain Asn Ann 


[A] [A] ~~" [Al 
where A;; is the cofactor of a,; of A. Hence the terms in the ith column of 


A^! are 1/| A| times the cofactors of the ith row of the original matrix A. 
For example, if A is given as 


[1 2 0 
A=|3 —1 —2 
1 0 —3 
then adj A and |A| are obtained respectively as 
3 6 —4 
adjA=|7 —3 2]; |A| 2 17 
1 2 —7 
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Hence the inverse cf À is given by 


3 & 27 

17 17 17 
As234A.|7 3 2 
=A] VI 17 17 

l 2 al 

Jf 17 17 


Formulas for finding inverse matrices are given below for a 2 x 2 
matrix and 3 x 3 matrix. Fora 2 x 2 matrix A where 


k= JF ad — bc +0 


[^ 


the inverse matrix is given by 


aq b e 
Azide fh [A] #0 
g h i 
the inverse matrix is given by 

e b. b c 
h i -|, i eg f 
gt ae "4 g el je € 
[Al| |g i eil la f 
de a.b a b 
g h| lg i d e 


The inverse of A can also be obtained by a different method which is 
given now. (Another method is given in Section 5-2.) 
Consider n simultaneous equations in X,, Xs, .. . s Xn- 


a4,X, F aX F oe + + aX = b, 
Gy, X + yg Xp + +++ + aX = dy 
An ,X, 4- AnoXe + MR a AnnXn —— b, 


or in the form of the vector matrix equation 
Ax = B 


Solving for x, we obtain 


X, = Cab, + eb, + -+ Cindy 

Xa = Caibi + Coby + ++. + Condy 

Xn = Cab, + Cab, + +++ + CanDn 
or 

X, Ci Cig Cin | [ b; 

Xa | | Car Cz Con b, 

Xn Cni Cn Cnn J LOn 


which is equivalent to 


X = A`'B 
Hence 
Ci Cis Cia | 
A^! c: Coy Coo Con 
Cni Cn» mem Cun 
This method is practical if it is possible to obtain x,, x», .. . , x, in terms of 


b, bs, ..., b, without much computational efforts. 


Symmetric matrix and skew symmetric matrix A symmetric 


Al =A or ay = d; 
Ifa matrix A is equal to the negative of its transpose, namely, : 
A' = —A or ay = —d;j 


then it is called a skew symmetric matrix. 
If A is any square matrix, then A + A’ isa symmetric matrix and A — A’ 
is a skew symmetric matrix. For example, if A is given by 


then 
” E 
A4-A'—|6 10 14| = symmetric 
10 14 18 
and 
0 —2 —4 
A— A'= |2 0 —2| = skew symmetric 
4 2 0 


Notice that if A is a rectangular matrix, then A’A is a symmetric matrix. 
Notice also that the inverse of a symmetric matrix is symmetric. To establish 
this fact take the transpose of AA^! = I. We have (A^')'A' = I’ = I. Noting 
that A= A’, we have (A7')'A’=(A7')'‘A=I=A™'A. Hence, A^ 
= (A“')’. Thus, the inverse of a symmetric matrix is symmetric. 


Orthogonal matrix A matrix A is an orthogonal matrix if it is 
real and satisfies the relationship A'A = AA’ = I. (This implies that | A| = 
+1 and A is nonsingular.) In the orthogonal matrix the inverse is exactly 
equal to its transpose. 


A= A’ 
Examples of orthogonal matrices are 
; 0.6 08 0 
cosÓ  —sin6 
es : r B = |—0.8 06 0 
sin 6 cos 0| 
i 0 0 | 
To demonstrate, let us compute A`! and B^! 
cos@ sing 9s -08 R 
A~ = : =A’, B^ = |0.8 0.6 0| =B 
—sin@ cosÓ 
0 0 i 


If A and B are n x n orthogonal matrices, then so are A~', A’, and AB. This 
may be seen as follows: Since A is orthogonal, AA’ = I and (A’)’A’ = I. 
Hence A’ is orthogonal. Since A^! = A’, A^! is also orthogonal. Since 
B' = B-!, A’ = A^!, (AB) = B'A' and (AB)! = B^'A*!, we have (ABY 
= (AB)'!, so AB is orthogonal. 


Hermitian matrix and skew Hermitian matrix A matrix whose 
elements are complex quantities is called a complex matrix. If a complex 
matrix A satisfies the relationship 
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A*—A or 0, = Gj 


where d,, is the complex conjugate of aj, then A is called a Hermitian 
matrix. An example is 


"em | 1 4 Pa 
4 — j3 2 
If a Hermitian matrix A is written as A = B + jC, where B and C are real 
matrices, then 
B = B', C z= —C' 
In the foregoing example 
1 4 0-3 
A=B+jC= j 
FJ | 4 | EJ i 4 


Notice that the inverse of a Hermitian matrix A is Hermitian, or A^! 
= (A^!)*. Notice also that every square matrix can be expressed uniquely as 
A = G + jH, where G and H are Hermitian and are given by 

— ui * - waaay 
= (A + A*), ml A*) 
The fact that G and H are Hermitian is seen as follows: 
Gt = (A* -A) - G 
—1 
H* = — * — = 
2j (A A)=H 


We can easily verify that for n x n Hermitian matrices A and B, the 
matrices A + B, A — B, AB + BA are also Hermitian. The product AB is 
Hermitian, however, if and only if A and B commute, since AB = A*B* 
= (BA)*. The determinant of a Hermitian matrix is always real, since 


|A| =|A*|=|A’|=[A| 


If a matrix A satisfies the relationship A* = —A, then A is called a skew 
Hermitian matrix. An example is 


u | js —2+ 3 
2 Q-j j | 
If a skew Hermitian matrix A is written as A = B + jC, where B and C are 
real matrices, then 

B = —B', C=C 
In the present example, 


0 —2 
A-sejc-[ | dj 
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Unitary matrix A unitary matrix is a complex matrix in which 
the inverse is equal to the conjugate of its transpose. Namely, 
A^! = A* or AA* = A*A =I 


An example of a unitary matrix is 


1 " 1 r 
et) zm +] 


=g ua " 
-(—3 2— 
vis‘ +j) Wah J| 
To demonstrate, let us compute À^' and A*. Since for a unitary matrix A 


the determinant of A is equal to one or | A | = 1, we obtain 


1 ; 1 ; 
m;0-2 -75C +D 


The conjugate transpose A* is given by 
1 


D VU eee 
MEC uds a 
jae p 2048 
A/ 15 A/15 
Hence we verify that A^! = A*. 


Orthogonal matrices satisfy the relationship, AA* = A*A = I; hence 


they are unitary. 
Notice that if A is unitary, then the inverse A^! is also unitary. To see 


this note that since AA* = A*A = I, we obtain 
(A*)A7! = (AAT) = I 
A^ (A*)"! = (A7YA-)* a | 


If 1 x n matrices A and B are unitary, then AB is also unitary. To prove 
this, notice that since AA* — A*A — I and BB* — B*B — I, we have 


(AB)(AB)* = ABB*A* = AA* = I 
(AB)*(AB) = B*A*AB = B*B = I 
Hence it is shown that AB is unitary. 


Normal matrix A matrix which commutes with its conjugate 
transpose is called a normal matrix. Namely, for a normal matrix A 


AA* = A*A (if A is a complex matrix) 
AA’ = A'A (if A is a real matrix) 
Notice that if A is normal and U is unitary, then U-'AU is also normal, since 
(U^ AU)(U-!AU)* = U^'AUU*A*(U-))* = U-'AA*(U-)* 
= U*A*AU = U*A*(U-))*U-!AU = (U-!AU)*(U-!AU) 


A matrix is clearly normal if it is symmetric, or Hermitian, or skew sym- 
metric, or skew Hermitian, or unitary, or orthogonal. 


Summary In summarizing definitions of various matrices, we 
give the following list for a handy reference: 


A’ = A is symmetric 
A'— —A A is skew symmetric 
AA'=A/A =I A is orthogonal 
A*=A A is Hermitian 
A* = —A A is skew Hermitian 
AA* = A*A =I A is unitary 
AA* = A*A or AA’ = A'A A is normal 


2-2 RULES OF MATRIX OPERATIONS 


This section first reviews some rules of algebraic operations of matrices 
and then gives definitions of the derivative and integral of matrices, Finally, 
rules of differentiation of matrices are presented. It will be seen that matrix 
algebra differs from usual number algebra in that matrix multiplication is 
not commutative and that cancellation of matrices is not valid. 


Sum of matrices The sum of two n X m matrices A and B is 
a matrix in which each element is the sum of the corresponding elements of 
two matrices À and B. Namely, 


4, + b Qy + dy = Aim + bim 
“a Bee Qs) T ba ds T ba Am T Dam 
Anı Ru by On» T bas ee Anm + D, 


Note that the sum of matrices A and B is not defined unless the number of 
the rows of A is the same as that of B and the number of the columns of A is 
the same as that of B. 


Difference of matrices If A and B are n xm matrices, the 
equation B + X = A hasa unique solution for X. This solution, X=A—B, 
is called the difference of A and B. The difference A — B is given by 


Gy — By Ay bue oee Aim — Bim | 

Gey — brr d — bo oee Glam — Dy 
A—B-— ; : : 

Gy Dig, a = Dgy ce am — Dg 


The difference of matrices A and B is not defined unless the number of the 
rows of A is equal to that of B and the number of the columns of A is equal 


to that of B. 


Multiplication of matrix by scalar The product of a matrix 
and a scalar is a matrix in-which each element is multiplied by the scalar. 


Namely, 


ka; kay, cec. kaim | 

kän dip sor Kg 
kA =f i s 

kan, Kans t Kanm 


Multiplication of matrix by matrix Multiplication of a matrix 
by a matrix is possible between matrices in which the number of the 
columns in the first matrix is equal to the number of the rows in the 
second. Otherwise, multiplication is not defined. 


[än di c5 dya Dig Diy 5 Dig 

Ge, 0, c0 dm || Ba, Da +++ 5s 
AB — E $ : : . 
Qn, Ano Anm bmi Ome bs] 
Cir €i? Cir 
Cor Cas Cor 
Cni Cno zx Cnr 
Y agb 
Ci = Q;; jk 
where ik T ij 


Thus multiplication of an n x m matrix by an mxr matrix yields ann x £ 
matrix. It should be noted that, in general, matrix multiplication is not com 


mutative 
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AB =~ BA or p abs, Æ X b;;a5; 
ja j=1 
For example 


AB — » ^i p "x - beg Tas anbat isba 
as Are} (b,, b a,b, + ds b, a3, b,, + ass bzz 
and 
BA — ^s zi » a x po + 5144, bira, + Dira 
be, ba G2, Azs 5,4, + bs; Qo, b, a, + bs; 05; 
Thus, in general AB + BA. Hence, the order of multiplication is significant 
and must be preserved. If AB — BA, we call the matrices A and B commute. 
in the preceding matrices A and B if, for example, a,, = a,, = b,, = b,. = 0 
then A and B commute. i E : 
For n X n diagonal matrices A and B 
a,b, 0 
ds; boy 


AB = (a, 86,(5,, Si) = 


0 Gabi 
If A, B, and C aren X m matrix, m X r matrix, and r X p matrix, re- 


spectively, then the following associativit 
; y law holds true: (AB) C — 
This may be proved as follows: (AB) C = A (BC). 


(i, k)th element of AB = >> a;by, 
j=1 

(J, h)th element of BC = Y 5,c,, 
k=l 


i, h)th el =3($ SSE 
(i Ath element of (AB)C = $ (Z ayb) cm = È Duy bpn 
= zz a, (bj cu) = E au( Eben) 
= (i, A)th element of A(BC) 
Since associativity of multiplication of matrices holds true, we have 
ABCD = (AB) (CD) = A(BCD) = (ABC)D 
Artn — AnA” (m,nz1,2,3,.. a 
If A and B are n X m matrices and C and D are m X r matrices, then the 
following distributivity law holds true: ' 


(A + B) (C + D) = AC + AD + BC + BD 
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This can be proved by comparing the (i, j)th element of (A + B)(C + D) 
and the (i, j)th element of (AC + AD + BC + BD). 


Partitioning of matrix A matrix consisting of q x p (p < m, 
q < n) array of elements obtained from an n x m matrix A by deleting any 
n — q rows and any m — p columns of A is called a submatrix of A. It is 
sometimes useful to partition A into submatrices and regard them as its 
elements. Partitioning of A into submatrices may be made in many different 
ways. For example, A may be partitioned as follows (the partitioning is 
denoted by the vertical and horizontal dotted lines): 


| 
aii Qip | Fi pu im 
7 1 
" [j 
e ' 
! 
Qai Aap | Gaps Aam 
A m—|—-————————————————— i TLR 
Aq+i Qqeip | Qasi psi Aari m 
. 1 . 
: 1 
$ i 
I 
Ani Anp i) Qn pti 2s Anm 


The elements of A contained in rectangles formed by dotted lines are the 
submatrix elements of A. By denoting the submatrices of this example by 
Ain Ain Ao}, and A,,, we obtain 


| Ay, i Ais | 
A = |----4 : -—--— 
An, E 


where 

[d aip 

A = : | =q X p matrix 
-Aq Aap 
LT o lm 

Aum]: |: | =q X (m — p) matrix 
Ldqpei °°" qm 
LT TES sip 

A c| : : = (n — q) X p matrix 
Lani =e Anp 
Diaper ee c wm 

Asc : : = (n — q) X (m — p) matrix 
Lan pri ace © Cnm 


Carr out submatrix multi ication 
l t 


where 

by by 

B, : —px smatrix 
by, bys 
b ua bir 

B.-—|: : — p X (r — s) matrix 
b, $*1 by, 
Bos L Bye s 

Bum]: — (m — p) x s matrix 
Da exu Dans 
D oki 5*1 by Í 

B., = 4 ; 

22 à : | n — p) x (r — s) matrix 
bn S+1 CE Ni D 
then 


AB — PEN ps sce a A By, 
A, Bi, + Ay. Bo, As, Bi. + ris 


This equivalence may be seen if we i 7 
examine th c 
Assume that 1 « i X q, l <k s. Then ianiai 


i, k)th el =>) - me 
(i, k)th element of AB = p 4b, = p ajb + X issy A 
p 
= E [(é, /)th element of Ai] [(J, K)th element of B,,] 
m-p 
T 2 [(é, j)th element of As] [(J, Ath element of B4] 


= (i, k)th element of (A,,B,, + A,.B.,) 


Hence, we have the following relationships: 


Be AX Bs] [A,B, AB, + A, B,, 
9 Anlè Bal — 0 As. Boo | 


b 0 lo" 0 — AB, 0 
9 Alô BJ | 0 "4 


and 


Notice that if A,, and A,, are nonsingular, then 
An t Edd Aj —Aj'A,, As! 
ha ch a] 
and 


C Lio | An! 0 
As. As) — [ZAR An AR As 
Partitioning is useful in additions and multiplications of matrices because 


the computing labor can be lessened considerably. To demonstrate this, 
consider multiplication of two matrices. If two matrices A and B are given by 


E 212 4 : 
&-| 3 1 | 1 i| = tania 
3 0 
1 0 E 
2 d B. 
Q i 


then AB is obtained as follows: 
AB — [Ai B, ep A,» Bo,] 


“Lah d+h lb 15-6 o 


Remarks on cancellation of matrices Cancellation of matrices 
is not valid in matrix algebra. Consider the product of two singular matrices 
A and B. Take, for example, 


Then, 
29: 1 1 —2 
ately ibi a 


Clearly AB = 0 implies neither A = 0 nor B = 0. In fact AB = 0 implies 
one of the following three: 

(a) A=0 

(D B=0 

(c) Both A and B are singular 

We can easily prove that if both A and B are nonzero matrices and 


_fo _, 
zi" -ia 
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AB — 0, then both A and B are singular. Assume that if A and B are not 
singular, then |A| + 0 and therefore A-! exists. Then we obtain 


A'AB=B=0 
which contradicts the assumption that B is nonzero. Thus we conclude that 
both A and B must be singular if A z 0 and B 35.0; 
Similarly, notice that if A is singular, then neither AB — AC nor 


BA = CA implies B = C. If, however, A is a nonsingular matrix, then 
AB = AC implies B = C and BA = CA also implies B = C. 


; Derivative or integral of matrix The time derivative of a 
matrix A(t) is defined by the matrix whose (i, )th element is the time deri- 
vative of the (i, j)th element of the original matrix, provided that all the 
elements a,,(t) have derivatives with respect to t. Namely, 


s A(t) = (£ at) 


Similarly, the time integral of a matrix A(t) is defined by the matrix whose 
(i, th element is the time integral of the (i, j)th element of the original 
matrix, or 


[^a = (J ar) dt) 


provided that the a;,(r) are integrable as functions of t. 


Differentiation of matrix If elements of matrices A and B 
are functions of time, then 


(a dB 
jj (AB) = ABs Ad 
d dA dB 
£ (A 4- B) =A 

g cmm + 


Here again multiplication of A and dB/dt is, in general, not commutative. If 
& is a scalar and is a function of time, then 


da 


d _ dA 
ai (Ad) = Fra ATE 


The time derivative of A-! is given by 


W ae WR 
Q4 monet 
This may be obtained by differentiating AA-! with respect to 1. Since 
d a dA dA-! 
ipo. A BM SS AS —— 
dt A dt AEPA dt 
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d abus E = 
ar AA ag 0 
we obtain 
dA — dA, -14 dÀ ! | A71 dA 4-1 
dcm gr^ or AA d dt 


which is the desired result. 


2-3 VECTORS AND VECTOR SPACES 


This section presents definitions and brief discussions of vectors and vector 
spaces. 


Vector Let F denote the real number field or the complex 
number field. The elements of F are termed scalars. An n-dimensional 
vector x, called n vector, over a field F is an ordered set of n numbers which 
belong to F. We denote it in the form: 


The n numbers x,, X.,..., x, are called elements or components of x. Vectors 
over the field of real numbers are called real vectors; vectors over the field of 
complex numbers are called complex vectors. The vector previously denoted 
is called a column vector. The column vector may be considered as an 
n X 1 matrix. The transpose of a real column vector x is a real row vector 


denoted by 
x e xx; X] 
The conjugate transpose of a complex column vector is a complex row 
vector denoted by 
x* = [X, X, +--+ Xn] 
Note that the conjugate transpose of a real column vector is simply a real 


row vector. 
One-dimensional vectors are scalars. (Scalars are either real num- 


bers or complex numbers.) 


lIn this book all vectors imply column vectors. Any row vector is written as the 
transpose or conjugate transpose of a column vector. 


If A is an x n matrix and x is an n vector, then the product Ax is an 
n X | matrix or column vector. 


Two vectors x and y are called equal, if and only if their corresponding 
components are equal, or x, — y, (i — 1,2 = MB) 


$255... 


Null vector The vector whose components are all zero is called 
the null vector. It is denoted by 0. 


Vector space A vector Space V over a field F is defined as a 
set of all vectors over F for which the operations of addition and multipli- 
cation by scalars are defined and satisfy the following rules: 


l. For each pair of vectors x and y in V, the sum of x and y, denoted by 


X + y, belongs to V. Addition of vectors is commutative and associa- 
tive, namely 


X+y=y+x 
&+y)+z=x+(y+z2) 


2. For each pair of vectors x and y in V there exists a unique vector z 
in V such that x + z = y. In particular, there exists a unique vector 
denoted by 0 such that x + 0 = x for all x in V. To every vector x in 
V, there corresponds a unique vector —x such that x + (—x) = 0. 

3. For each vector x in V and each scalar c in F the product cx belongs to 


V. Multiplication of vectors by scalars is associative and distributive, 
namely, 


(c,c)x = ¢,(cx) 
c(x + y) = cx + cy 
(t + ox = ex CoX 
The product of the identity matrix I and a vector x is x itself, or 
Ix =x 


The foregoing definition implies that a vector space is a set of vectors closed 
with respect to vector addition and multiplication of vectors by scalars. 

If F is the field of real numbers, V is called a real vector space. Similarly, 
if F is the field of complex numbers, V is cailed a complex vector space. In 
this book all. vectors and scalars treated are assumed to be elements of a 
complex vector space unless the contrary is stated. 


Linear dependence and independence Vectors X, XS onis s IR 
are said to be /inearly independent if 
€X, + ex, +--+ + CrXn = 0 


T? pop umpe 

where c,,€,,...,c, are constants, implies that c, = c, = Ča ; 

Conversely vectors X;, Xa - - - , X, are said to be linearly dependent if and only 

: - . . . sans ? » 

if x; can be expressed as a linear combination of x,(j=1,2,...,n; 
1 


js i. 


Theorem If vectors x,, X, ..., X, are linearly independent 
and vectors X,, Xo, » Xa, Xn+; are linearly dependent, then x,,, can be 
: x A ELE t ; i 
expressed as a unique linear combination of X,, Xj, .. . , Xn. 


Proof Since vectors x,, X, ... , Xn+, are linearly dependent, 


CX; =i Ca Xa rm ih sp Cn X5 T CneiXns 41 = 0 
where not all c; are zero. Clearly c,., 40. (Otherwise, the vectors Xi 
x X, become linearly dependent which contradicts the assumption that 
294 t3: 
they are linearly independent.) Hence, 


X, — X, 


Cn+1 Cn«1 Cust 


€ 
Cy C» Jasna eat ee 
Xn+1 Xi 


Thus, x,,, can be expressed as a linear combination of x,, X,, ..., x,. The 
5 n 


uniqueness of this expression can be proved as follows: Suppose that there 
are two expressions 


n " e n hs. 
Xia = È GX = BCX 


then clearly 


n 
à (ci — ci)Xi = 0 
iz! 
Since vectors X,, Xs, . .. , X, are linearly independent, 
Gi — g =0 
t 


Namely, 
q=e¢ = (l<ign) 

This proves the uniqueness of the expression for x,.,. | á. T 

It can be proved that the necessary and sufficient conditions for z 
vectors x; (i = 1, 2, ..., m) to be linearly independent are that 

lmn : 

2 "There exists at least one nonzero determinant made of m columns 

then x m matrix whose m columns consist of X,, Xs, ..., Xin 
Hence for n vectors X, Xə, ... , Xn, the necessary and sufficient condition for 
linear independence is 
|A| z5 0 

where A is the n x n matrix whose ith column is made of the components of 
38 (es, Bes 5). 
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Basis A set of vectors Xi Xy... , Xn Of a vector space V is 
called a basis of V if these n vectors are linearly independent and every vector 
y of the vector space V can be uniquely expressed as a linear combination of 
the vectors x, X,,. . . , Xn, namely, 


n 
y= Fan 


where the numbers a, are the coordinates of y relative to the basis x,, x. 
....X,. We shall denote the basis Xi, Xs... , X, by the symbol 


> 
W = (Ei Kien Xa 


Finite dimensional vector space A vector space V is called a 
finite dimensional vector space if the number of elements in its basis is finite. 
The number of elements in a basis of a finite dimensional vector space V is 
called the dimension of V. In this book we denote n-dimensional vector space 
by V" or simply by V. 

For a given vector space V a basis is not unique. In an n-dimensional 
vector space any basis has n elements. Conversely, any set of n linearly 
independent 7 vectors is a basis of an n-dimensional vector space. For exam- 
ple. the following vectors 


x,- |o 3 x, = ] y Xg = 0 
Lo! 0 1 


constitute a basis of a three-dimensional vector space. The following three 
vectors, however, 


[1] 2 2 1 
en » = I X, = 1 
Lo] 1 L1 


do not constitute a basis of a three-dimensional vector space, since they are 
not linearly independent. 

df any vector Y of an n-dimensional vector space V can be expressed as 
à linear combination of a set of vectors Xi, X), ... , Xm, then the set of vectors 


is said to span the n-dimensional vector space V. Consider the following set 
of n vectors: 


l 0 0 

0 l 0 
a , X, = , s Xa = 

0 LO 1 
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Since any vector y where 


Yn 
can be written as 
y — yix, + ysXs spese + YnXn 


the set of vectors X,, X.,..., X, spans n-dimensional vector space V. 


Subspace Any nonempty set S of vectors of V is called a 
subspace of V if itis closed with respect to vector addition and multiplication 
of vectors by scalars. Namely, if any pair of vectors x and y belongs to S, 
then x + y also belongs to S; and if any vector x belongs to S and c is any 
scalar, then cx also belongs to S. Hence a subspace always contains Ox = 0. 
A subspace of a vector space is itself a vector space. 

For any set of vectors x,, X4, ... , Xm the set of all their linear combina- 
tions constitutes a subspace, and the subspace is spanned by them. If a set of 
vectors spans a subspace and if in addition these vectors are linearly inde- 
pendent, then the set of vectors is called a basis of the subspace. A basis of 
a subspace can always be found from a set of vectors which spans the sub- 
space by choosing a maximum number of linearly independent vectors from 
the set. Notice that the number of vectors in every basis of a subspace is the 
same and is equal to the dimension of the subspace. 


Linear transformation Consider a linear transformation 
z= Ax 
where X = m vector 
z = n vector 
A =n X m matrix 


The matrix A transforms a vector x in an m-dimensional vector space into 
a vector z in an n-dimensional vector space. The important problem in 
connection with linear transformations is the problem of inversion, the 
question of the existence of a solution to a system of linear equations. We 
shall discuss this matter in Chapter 7. 


Inner product Any rule, which assigns to each pair of vectors 
x and y in a vector space a scalar quantity (x, y), is called an inner product or 


scalar product in the vector s i i 
| pace provided that the fi 
are satisfied: banca: 


1. (y, x) = (x, y) 
where the bar denotes the conjugate of a complex number 
2. (ex, y) = e(x, y) = (x, ey) ; 
where c is a complex number. 


3. (Xy, Z + w) — (2) + (y, DE 
4. (x X) »0 forx #0 EU kia des 


axioms 


In any finite dimensional vector space, there are many different definition f 
inner product, all having the properties required by the definition is 
i Pt sient the contrary is stated, we shall adopt the following 

einition of the inner product: The inner product of a pair of n vectors x and 
y 1n a vector space V is given by 


(x, y) = XQ + Aer cR Zn = n Xii 
i=l 
= complex number 


Mes the xX; are the complex conjugates of the x;. This definition clearly 
satisfies pe four axioms. The inner product can then be expressed as 
(x, y) = x*y, where x* denotes the conjugate transpose of x. Also 


(x, y) = (y, x) = y* x = yx = x*y 
where y' is the transpose of y. 


The inner product of two n vectors x an 


> d y with re : 
therefore given by 7 al components is 


n 


ir XnYn = DS Xiyi 


i=l 


(x,y) = x,y, + Xoo t + 
In this case clearly, 


GG y) — x'y — y'x (for real vectors x and y) 


The real or complex vector x is said to be normalized if (x, x) = 1 
he should note that for an n vector x, x*x is a nonnegative scalar but 
Xx" Is an n X n matrix. Namely, 


tyf By E vs - 
b duo dil XX, + XX, ot 4 X2Xa 
c PTT "E - ^ 
- xf + [xf 4: [xf 
and 

XX; XF, X Xa 

xx* = Sis Xo X3 ial y XX, 

XaX, e + 36x. | 


Notice that, for a complex n x n matrix A and complex s vectors x and 


y, the inner product of Ax and y has the property that (Ax, y) = (x. A*y), 
where A* is the conjugate transpose of A. 

For a real n x n matrix A and real n vectors x and y, the inner product of 
Ax and y has the property that (Ax, y) = (x, A'y), where A’ is the transpose 
of A. 


Unitary transformation If A is a unitary matrix (A^! = A*), 
then the inner product (x,x) is invariant under the linear transformation 


x = Ay, because 
(x, X) = (Ay, Ay) = (y, A*Ay) = (y, A"! Ay) = (y, y) 


Such a transformation x = Ay,-where A is a unitary matrix, that transforms 


n n 
X x into Y j;y, is called a unitary transformation. 
i=l i=! 


Orthogonal transformation If A is an orthogonal matrix 
(A`! = A’), then the inner product (x, x) is invariant under the linear trans- 
formation x — Ay, because 

(x, X) = (Ay, Ay) = (y, A’Ay) = (y, A Ay) = (y. y) 


7i 


Such a transformation x = Ay that transforms >) x; into Y, y? is called an 


i i=l 


orthogonal transformation. 


Norins of vector Once we define the inner product we can 
use this ianer product to define norms of a vector x. The concept of a 
norm is somewhat similar to that of the absolute value. A norm is a function 
which assigns to every vector x in a given vector space a real number denoted 
by || x || such that 


1. xi > 0 forx 4 0 
2. ||x|| = 0 if and only ifx = 0 
3. ||ex|| = | a] [|x| 


where a is a scalar and |a | is the absolute value of a. 
4. lx+ yl & lIxl| a llyl| forall x and y 
Je y s«lixilllyl] (Schwarz inequality) 

Several different definitions of norms are commonly used in the literature. 
In this book, however, we shall employ the following definition: A norm of 
a vector is defined as the nonnegative square root of (x, X), or 


lx] = (x, 39'^. — [x*x]^ = A/D P E [xe]? t or ul 


If x is a real vector, the quantity || x | can be interpreted geometrically as the 
square of the distance from the origin to the point represented by the vector 


X. 


Cn 
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The five properties of norms listed earlier may be obvious, except perhaps 
the last two inequalities. We can prove these two inequalities as follows: 
From definitions of the inner product and norm, we have 

lax + y|? = (AX + y, Ax + y) = (Ax, Ax) + (y,Ax) + (AX, y) + (y, y) 
= ANIKI + AG. x) + ACK, y) + Ily 
= ADV |IX|I? + (x, y)] + AY) + Hy |? > 0 
If we choose 


IEA for x 0 
then 
AG; y) + ll yllt = -SPEV t llyllt >0 
and 


IXIP HY I > (x, NES) = |x, yr 
For x = 0, clearly, 


for xz 


lixe lly I? = Lx, y) 
Therefore, we obtain the Schwarz inequality, 
IG y)I « Ilxil [Ly] 
By the use of the Schwarz inequality we obtain the following inequality 
Ix + yl llxll + HLyll 
This can be proved easily, since 
lx+ yl — (x - y, x 4- y) 
= (x, x) + (x, y) + (y, x) + (Y, y) 
= [IxIl* + (x, y) + GG y) + Ily] 
= [IxIl* + Ily]? + 2Re(x, y) 
SIIXIP + lly |? + 21 (x, y)| 
S IIxI + Iyl? + 2 [1 xi yl 
= (IxII + lly |)? 
Three other definitions of norms commonly used in the literature (2-5)! 
are given: 
(1) A norm || x || may be defined by 
[x || = (Tx)*(Tx)!^ = [x*T*Tx]4 = [x*Qx]^ 


n n 1/2 
= [È š uan] 2o 


Numbers in parentheses refer to the list of references at the end of this chapter. 
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The matrix Q — T*T is Hermitian (since Q* = T*T = Q). The 
norm ||x|| = [x*Qx]"? is a generalized form of [x*x]'^ which can 
be written as [x*ix]'”. 

(2) A norm: may be defined as the sum of the magnitudes of all the 


components x,, or 
Ixil — 3 bx! 
i=l 
(3) A norm may be defined as the maximum of the magnitudes of all the 
components X;, or 
|x|] = max(] x; lj 
Among many definitions of norms, the norm [x*x]'? is most commonly 
used in explicit calculations. The concept of norms is useful to prove con- 
vergence of series. It can be shown that the various norms just defined are 


equivalent. To prove convergence of series we may use any of the foregoing 
norms. 


Orthogonality of vectors If the inner product of two vectors 
x and y is zero, or (x, y) = 0, then vectors x and y are said to be orthogonal to 
each other. For example, vectors 


[1 0 1 
"n Xs 210. xX, = |—1 
0 j i 0 
are orthogonal in pairs and thus form an orthogonal set. In an n-dimensional 
vector space, vectors X,, X.,..., X, defined by 


E 0 0 
0 1 0 
0 0 0 
x, = , x, = i > Xn = 
Lo 0 1 


satisfy the conditions (x,, x;) = 9, or 
(xax) =0 (iJ) 
(x, Xj) = ] 
where i, j = 1, 2, ... , n. Such a set of vectors is called orthonormal, since the 


vectors are orthogonal to each other and each vector is normalized. A 
nonzero vector x can be normalized by dividing x by ||x ||. The normalized 


vector x/||x]| is a unit vector. Unit vectors Xi, X4,..., X, form an orthonormal 
set if they are orthogonal in pairs. 


Consider a unitary matrix A. By partitioning A into column vectors 
A,, A», ..., An, we have 


A*A = [TETA As LAS] 


A*A, AA, AM AF A, 


ATA, AfA, --- AFA, 

ATA, ArA, py" ATA, 

oe weg 

"b ete | 
z | 

0 0 l i 


it follows that 
ATA; = (A, A) = 1 
AFfA; = (A;, A;) =0 (=j) 


bis ies see E 2 column vectors (or row vectors) of a unitary matrix A 
re orthonormal. The same is true for orthogonal matri i 
> atrice: 
en. g ices, since they are 
: IE that a real (complex) matrix is orthogonal (unitary) if and only if 
its columns or rows form an orthonormal set of vectors. Notice also that if 
nonzero vectors form an orthogonal set, then they are linearly independent. 


If a vector x of an n-dimensional i 
; vector space V is orthogonal 
of a basis of V, then x = 0. : it 


, Dual space and dual basis The vector space V’ consisting of all 
pie functionals on Vis called the dual vector space to V. If a set of vectors 
B = {X,,%, ..., Xa} is a basis of an n-dimensional vector space V, then 


T exists a uniquely determined basis Z' = [Yn Yo ..., Yn} in V’ such 
that l » 


(Ys X) =ð; (j=l, Bs Seo) 


The basis Z' is called the dual basis of 2. The dual vector space of an n- 
dimensional vector space is also n-dimensional. (Notice that the vectors 
Yo Yo, --., Yn are linearly independent and span the n-dimensional space 
V'. Hence they constitute a basis of V’.) 
Any basis has a unique dual basis. The n components of y, (i = 1, 2,..., 
n) namely Ji Jas -.., Yao can be determined by solving the following 
algebraic equations: 
JuXu + JaXn + +e PuXa = Òn 
JuXis + JaiXes rco + YniXns = Si 
YuXin T PaiXan E sae A zb JaiXan me ài 
(i12... aam) 


Notice that in the foregoing n simultaneous equations in Ji Pa, o., Pus 
the coefficient determinant is nonzero, or 


Xu Xe cts Xi 
Xig Xa 77 Xna 
#0 
Xin Xen ttt Xan 
since the rows of this determinant consist of X,, X.,..., X, which are linearly 


independent. Hence n components of y, can be determined uniquely. Thus 
the dual basis can be determined by the process of finding an inverse matrix 
and taking its conjugate transpose. This process is further explained in the 
following: Consider a matrix A consisting of xj, Xa, .. . , X, which constitute 
a basis of an n-dimensional vector space. 


Xy y tee Xin 
Xor Xas 000 Xon 
À = 
Xni Xn * ` Xnn_] 
Consider next a matrix B whose columns consist of y,, y, .... Ya which 
constitute the dual basis of the basis x,, Xs, .. . , Xn- 
Yu Vie Yin 
Yor Ys Yon 
B == H 
s Jue SOS Pimi 
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Then the product B*A becomes as follows: 


B*A = ( Fury) =C) - 1 
Hence, we see that B = (A^?)*. 
For an orthonormal basis x,, X,, ..., Xn, namely, (x, xj) = 9, the 
basis is itself its own dual basis. 
Any vector z of an n-dimensional vector space V can be uniquely ex- 
pressed relative to a basis Z = (x,, X,,..., Xn} as follows: 


= » (ys Z)X; 


where Yi, Ye, ... , Yn are the dual basis of a basis xi, X,,..., Xn. This can be 
easily proved. Since z can be represented as 


z-— ux 
t=1 
the inner product of y; and z is given by 


(Yi, Z) = (v. 2 aX) c 
Hence, 


2-4 DEFINITIONS OF TERMINOLOGIES IN 
MATRIX ANALYSIS 


In this section, we review commonly encountered terminologies in matrix anal- 
ysis. Specifically, we review definitions of eigenvalues, eigenvectors, norms, 
definiteness and semidefiniteness of quadratic.forms and Hermitian forms, 
elementary transformations, the rank of a matrix, and similarity of matrices. 


Eigenvalues of square matrix Let A be an n x n matrix. The 
determinant 


|A — AI | (I = identity matrix) 


is called the characteristic polynomial of A. It is an nth-degree polynomial in 
A. The characteristic equation is given by | A — AI| = 0, or 


d —X Ayo "p Qin 


5, Gym eei Gon 


|A — Al] = 


Any Ano E Gan —X 


= (J (A + ATI 64M + Cad + Ca) = 0 
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The n roots of the characteristic equation are the eigenvalues of A. They are 
called the characteristic roots. Notice that a real n X n matrix A does not 
necessarily possess real eigenvalues. But since | A — AI| = 0 is a polynomial 
with real coefficients, any complex eigenvalues must occur in conjugate 
pairs, namely, if @ + jB is an eigenvalue, then œ — JB is also an eigenvalue 
of A. Notice also that for a diagonal matrix or a triangular matrix, n diagonal 
elements are the n eigenvalues of the matrix. 
If we define the eigenvalues of A as A,(A), and those of A^! as [JL (A ^), 
n 
: AP(AÀ) = A GE gs staat) (2-6) 


Namely, if à; is an eigenvalue of A, then A;! is an eigenvalue of A^'. For 
example, if A is given by 
1 8 
aml al 


then the eigenvalues of A are found from 


1— 3 
=(A+1Xà—4)=0 
2 2—X ( 
or 
X = eb Xs =4 
Hence, 
URN ae Mense ax | NT DRE 
^ ( ) is c e , 2 Xo 4 
The inverse of A can be found as 
cL E 
i 2 4 
aliis Ix 1 
19i 4 
The eigenvalues of A^! can be obtained from 
—i—u 3 
à * |-Qu-1u-2-0 
1 mp cu 
z q— 7 
or 
þa = exl, Ho = + 
Hence, 


(A) =, = —1, pA) = He = 4 
Thus we have demonstrated that Eq. (2-6) holds true. 
Finally, it is noted that it is possible to prove that, for two square matrices 
AandR 


[4B — AI| = |BA — AI| 
(For the proof see Prob. A-2-29.) 


i s Wege. » ica matrix Any nonzero vector x, such that 
kN areas zie Gun Pia ector associated with an eigenvalue A; of an 
homo ; components of x, are determined from n linear 
geneous algebraic equations within a constant factor, if x, is an ei 
vector, then, for any scalar œ + 0, &X, is also an eigenvector. The daa 


IS called a normalized eigenvector if the len th Or ab 


EN Norms of matrix The concept of norms of a vector can be 
ide to matrices. There are several different definitions of norms of 
matrix. Some of them follow: j 


(1) A norm | A|| ofan n x n matrix A may be defined by 


A|| = mink 
such that 


I| Ax || < & || x [I 
For the norm [x* x]'^, this definition is equivalent to 


l| A] = max[x* A* Ax; x" =l} 


which means that ll Al] 1S the maximum of the absolute value of the 


(2) A norm ofann x n matrix A may be defined by 


IIAIl — E X lau] 


where |a;;| is the absolute value of dij. 
(3) A norm may be defined by 


" " nin is 1/2 
IAI = (È € laut) 
(4) Another definition of a norm is given by 
All = max {33 Jas} 
j=l 


Norms ofan n x n matrix A have the following properties: 


(1) IAI = I[A* JI 
(2) A+B] « [I A |] + [B [I 
(3) I ABI] « || A || || B] 


(4) Ax] « I| A 1 [Ix]! 


Quadratic form, bilinear form, and Hermitian form For a real 
symmetric matrix A and a real n vector x the following form: 
n n 
X'Ax — 2 >» Qj XiX) (a; = aij) 
is called a real quadratic form in x;. x/Ax is a real scalar quantity. 
For an n x m real matrix A, a real n vector x, and a real m vector y, the 
following form: 


X'Ay — >; > Qi Xi Vj 


i=1 j=l 


is called a real bilinear form in x; and y;. x'Ay is a real scalar quantity. 
For a Hermitian matrix A and a complex n vector x, the following form 


x* Ax = zx 0; X;X;j (aj; = à) 
i=] j= 
is called a Hermitian form. Notice that the scalar quantity x*Ax is real, 
because 
X*Ax = X'Ax = (x'Ax)' = X'A'x = x* Ax 

Among the three forms defined here, the quadratic form and Hermitian 
form are most important in the study of stability analysis of dynamic systems 
(see Chapter 8) and optimal control systems (see Chapter 9). In this book we 
shall therefore focus our attention mostly on the quadratic form and Her- 
mitian form. 


Definiteness and semidefiniteness A quadratic form x’Ax 
where A is a real symmetric matrix (or a Hermitian form x*Ax where A is 
a Hermitian matrix) is said to be positive definite if 


x’Ax > 0 (or x*Ax > 0) for x ze 0 

x’Ax = 0 (or x*Ax = 0) for x = 0 
x’Ax (or x*Ax) is said to be positive semidefinite if 

X'Ax > 0 (or x*Ax > 0) forx #0 

X'Ax = 0 (or x*Ax = 0) forx = 0 
X'Àx (or x*Ax) is said to be negative definite if 

x'Ax < 0 (or x* Ax < 0) for x 40 

x'Ax = 0 (or x*Ax = 0) forx = 0 
x’Ax (or x*Ax) is said to be negative semidefinite if 

x’Ax <0 (or x*Ax < 0) forx ~0 

Ax '0 (or x*Ax = 0) forx — 0 


80 REVIEW OF MATRICES AND VECTORS 


If x'Ax (or x*Ax) can be of either sign, X'Ax (or x*Ax) is said to be 
indefinite. 

Necessary and sufficient conditions for the quadratic form x'Ax (or 
Hermitian form x*Ax) to be positive definite, negative definite, positive 
semidefinite, or negative semidefinite were given by T. T. Sylvester. We shall 
postpone detailed discussions of Sylvester's criteria to Section 5-5. 


Linear dependence and independence A set of k columns 

An Às..., Ay of an n x m matrix A is linearly independent if and only if 
the equation 

€, À, + CÅ, + wee +c,A,=0 
implies 

Ses S ume Se, = 0 

If there exist elements c,, not all zero, such that 

Cy; Ay + cA + --* + Cy Ay = 0 


then a set of k columns of a matrix A is linearly dependent. For example, 
the columns of the matrix 


1 4/2 D 
A= |0 0 0 
0 Lt yB 
are linearly dependent, since 
Y] A2) = 6 Tr «fe 
(=T) m 0 Mz: 0 |-0 
0 1 "i3. 0 


If k rows (or columns) of a matrix are linearly independent, then any 
subset of i rows (i < k) is linearly independent. If a set of k nonzero rows 
(or columns) of a matrix À is linearly dependent, then there exists a maximum 
subset of i rows (or columns) (i < k) which is linearly independent. The 
remaining k — i rows (or columns) are linear combinations of i linearly 
independent rows (or columns). Consider the following matrix: 


1,2. 3 4 
Ma On ky =) © 
ro i 2 
LA 0 2 


The four columns of A are not linearly independent, since the following 
equation: 
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0 1 —1 0! |0 
€ 1 + Cs 0 T6 1 + C4 2 0 
1 1 0 2 0 


can be satisfied by the nonzero c;. To demonstrate this let us solve the fol- 
lowing simultaneous equations 
c, + 2c, + 36, + 46, = 0 
oiim iO 
c, +c + 2c, — 0 
ci 0 + 2c, = 9 
which yield 
à = 1.5, c, = 055, 6S 105, g=] 
We therefore see that the four columns of A are not linearly independent. 
However, the three columns of a submatrix of A 


1 2 3 
9 Xy =a 
1 0 1 
are linearly independent, since 
1 2 3 0 
c,|0 dece ll|dce —1 |=| 0 
Li 0 1 0 


implies that c, = c, = €; = 0. We can easily verify that any subset of these 
three columns is linearly independent. 


Elementary transformations Suppose that an n X m matrix 
A is a linear transformation from a vector space V' to a vector space V. The 
effect of a change of basis in V is to replace A by PA where P isa nonsingular 
n x n matrix. The effect of a change of basis in V' is to replace A by AQ, 
where Q is a nonsingular m X m matrix. By choosing suitable bases for V 
and V the matrix A can be given the following simple form: 


I 0 
PAQ =|? o 


where I, is the r x r identity matrix (r <7, f < m) and the remaining ele- 
ments are zero. The matrix A can be given this simple form by performing 
a suitable sequence of elementary transformations of the following types: 


(1) The interchange of any two basi i 
e SIS V. : 
^s y ectors x, and x; in V (or y, and y; 
(2) The replacement of a basis vector x 
or x; by ex; (or y; b ; i 
a nonzero scalar. : T ititi 
(3) The replacement of a basis ve 
: ctor x; by x; + cx, (or y, i+ 
where c is a nonzero scalar. ere ae z 


The effect of elementary transformations is to carry out analogous operations 


ca led pring i TOW (or column) operations on the TOWS (or columns) of 


Elementary row or column Operations The following row (or 


) S 
column operation ona matrix A are called the element. co. 
ary row (or lumn) 


(1) The interchange of any two rows (or columns). 


(2) The multiplicati 
cr p ion of a row (or column) by a constant c where 


(3) The addition of one row (or ipli 
column) multipl 
another row (or column). ! rap a 


An n X n matrix obtained from the n X n identity matrix I by means of 


one elementary row (or colu lon i 
mn) operation is called an elen i 
1 
A few examples are ESR 


o0 X J e 0 © II 0 T 
0 1 0|, O0. 4 Oy, 9-1 ð 
1 0 0 0 0.1 00 1 


it elementary matrices are nonsingular. Notice that the inverse of an 
mii matrix is also an elementary matrix. Clearly, elementary oper- 
ations do not alter the number of linearly independent rows (or columns) 
of the matrix. 


Any nonzero n x m matrix A c 

an be reduced b i 

to a matrix of the form p inna teins 
IL 0 
0 0 


where I, is the r x r identity matrix (r X n, r < m) and the remaining ele- 
ments are zero. In the following, we shall prove the foregoing statement 
Since A is a nonzero matrix, at least one element is nonzero. We can always 
bring a nonzero element in the (1, 1)th place by means of elementary oper- 
ations. Also we can make this element equal to 1 by multiplying an ap ae 
priate constant. It is simple, by means of elementary operations ie gc vm 
all the elements in the first row and the first column except the (1, 1)th 
element equal to zero. Then matrix A becomes 


If A, Æ 0, we repeat the same process. Notice that this process will not 
affect the first row and the first column. If the same process is continued, then 
the transformed matrix finally becomes of the form 


In the preceding process, we employed only elementary operations. Therefore, 
we may write 
E. d] 


PAQ — 5 0| 


where both P and Q are products of elementary matrices. 

We shall next show that every nonsingular matrix A is a product of 
elementary matrices. Since there exist matrices P and Q such that PAQ — I, 
where P and Q are products of elementary matrices, and therefore are non- 
singular, we have A = P^! Q-'. Since P^! and Q^! are products of inverses 
of elementary matrices and the inverse of an elementary matrix is also an 
elementary matrix, P^! Q^"! = A is a product of elementary matrices. Hence 
we have proved that any nonsingular matrix is a product of elementary matri- 


ces. 


Rank.of matrix We shall first prove that the maximum 
number of linearly independent rows of a matrix A is equal to the maximum 
number of linearly independent columns. Let A be any nonzero n X m 
matrix and let P be an n x n nonsingular matrix and Q be an m x m 


nonsingular matrix such that 


PAQ — lo o 


Ü 6 


where I, is the r x r identity matrix (r < n, r < m). The maximum number 
of linearly independent rows of PAQ is r, which is the dimension of the 
r X r identity matrix. The maximum number of linearly independent rows 
of PA is the same as that of A, since P is a nonsingular matrix which is a 
product of elementary matrices. (Elementary operations do not alter the 
maximum number of linearly independent rows—or columns.) Since 


I, eat die Q,.] PE [Qu Q.»] 
0 0, | (0 0jiQ4 Qa) LO ©] 


PA =| 
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where Q,, and Q,, are r-rowed matrices and since Q^! is nonsingular, the 
m rows are linearly independent. Hence the maximum number of linearly 
independent rows of PA or equivalently A, is r. 
Consider next 
*A*Dx* —_ I, : 
OAP = l o! 
The maximum number of linearly independent rows of Q*A* is the same 
as that of A*. By a discussion similar to the preceding we find the maximum 
number of linearly independent rows of Q*A* to be r. Hence the maximum 
number of linerly independent rows of A* is r, which means that the maxi- 
mum number of linearly independent columns of A is r. Thus we have proved 
that the maximum number of linearly independent rows of a matrix is equal 
to that of linearly independent columns. 
We shall now define the rank of a matrix. A matrix A is called of rank m, 
if the maximum number of linearly independent rows (or columns) is m. 
Hence if there exists an m x m submatrix M of A such that |M | + 0 and 
the determinant of every r x r submatrix (where r > m + 1) of A is zero, 
then the rank of A is m. (If the determinant of every (m + 1) x (m + 1) 
submatrix of A is zero, then any determinant of order s (s > m + 1) is 
zero. since any determinant of order s > m + 1 can be expressed as a 
linear sum of determinants of order m + 1.) As an example, consider the 
following matrix: 


[i 2 3 4 

ac Q I —1 0 

1 9 I 2 

RE! 0 2 

A maximum submatrix whose determinant is not equal to zero is 

I 2 3 
0 1 =1 
L oO 1 


Hence the rank of A is three. 

The computation of determinants for the determination of the rank of 
a matrix may be time-consuming. The computational effort can be reduced 
greatly by means of elementary row or column operations, since elementary 
operations do not alter the rank of a matrix. The invariance of the rank under 
elementary operations can be seen as follows: Since the rank of a matrix 
A is the maximum number of linearly independent rows (or columns), ele- 
mentary operations of type (1) and type (2) clearly will not alter the rank of A. 
Since the maximum number of linearly independent rows (or columns) is 
not altered by the addition of one row (or column) multiplied by a constant 
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to another row (or column), the elementary operation of type (3) also will 
not alter the rank of A. Hence the rank of a matrix A is invariant under 
elementary operations. In order to determine the rank of a given matrix 
A, we may therefore premultiply and/or postmultiply A by a series of ele- 
mentary matrices and transform A into a simpler form with a number of 
zero elements so that the rank can be found immediately. 


Rank of product of matrices Consider an n x m matrix A 
and an m x p matrix B. Assume that the rank of A is r. The rank of the 
product AB cannot exceed the rank of A or rank of B, namely, 


rank AB « rank A 

rank AB < rank B 
To prove this notice that nonsingular matrices P and Q exist such that 
PAQ — le " 
0 0j 


J 


a-i Jei] 


where F is a matrix of rank r. Consider next 


F H 
PAB = |---|B= |-- 
0 0 
Since F has r nonzero rows, H has at most r nonzero rows. Hence 
rank PAB < r 


Since P is a nonsingular matrix 
rank PAB = rank AB 


or 


Therefore 
rank AB < rank A 


Similarly, we obtain 
rank B*A* < rank B* 
Since rank B*A* = rank AB and rank B* = rank B, we obtain 
rank AB < rank B 


Equivalence of two matrices Ann X m matrix A is said to be 
equivalent to an n X m matrix B if and only if there existan n X n nonsingular 
matrix P and an m x m nonsingular matrix Q such that A = PBQ. This is 
equivalent to saying that A and B are equivalent if and only if they have the 


I, 0 
A= 4 
el ME 


and also there exist an i 
n X n nonsingular matrix i 
gular matrix T such that , TERGUM uM 


I, 0 
B= 7 
sh gT 
or 
~ 0 ; 
=s- -1 
2E 
Hence, 


A = GS^'BT-'H = (GS-) B(T- H) 


oi SÉ. e. are nonsingular matrices, we may put GS-! = P and 
= Q. Thus, if A and B have th = 
eami a € same rank, then A = PBQ, or A and 


Note that every matrix A is equivalent to a unique n x m matrix of the 


form 
Jiu = D , 
0 0 


In other words, th i 
» the matrices J,,, for various 
t r, n, marea i 
forms under equivalence. LE opi s 


asses na matrices Let us next consider a matrix representing 
b sd rans! ormation J of the vector space V into itself. Let us define a 
ew basis Xy, Ra... , $, in V by the equations 


i, = QuXi + gX +- 4 QinX, 
x, = 21X1 + JoXa + --- + Qon X; 


Rn = qu X, + QnsXo + +++ + AnnXy 
The matrix Q = (q:;) is a nonsingular matrix. Suppose we define 
Q'-—P- (Pi) 


then 


X, = PaRa + Paaa + +++ Pra, 


Let f have a matrix A relative to the basis x,, X4, ..., x4. Then f has the 


matrix P’AQ’ relative to the basis $,,$,,. .., Xn. 
Two n x n matrices A and B are called similar, if a nonsingular matrix 
P exists such that 
PAP =B 
The matrix B is said to be obtained from A by a similarity transformation, in 
which P is the transformation matrix. Notice that A can be obtained from B 
by a similarity transformation with a transformation matrix P^!, since 
A= PBP = (PY! BE- 
A change of basis in V, therefore, replaces A by a similar matrix B. 

If A can be reduced to a similar matrix which has a simple form, then 
important properties of A can be readily observed. A property of a matrix 
is called invariant if it is possessed by all similar matrices. For example, the 
determinant and the characteristic polynomial are invariant under a similarity 
transformation as shown: 

|B| = |P?' AP| = |P~'| |A] IP] 
=]|A||P7||P| 2 |A||P7 P| 2 |A||I|= |A| 
and er" 
|B — M| = |P7'AP — M| 
= |P-'AP — P-'(ADP| 


= |P- {A — ADP| 
= |P^'||A — AI||P| 
— [A — AI||P-! ||P| 


Notice that the trace of a matrix is also invariant under similarity transfor- 


mation, or 
tt A= tr (PAP) 
But the property of symmetry of a matrix is not invariant. 

Notice that only invariant properties of matrices present intrinsic char- 
acteristics of the class of similar matrices. To determine the invariant prop- 
erties of a matrix A we examine the Jordan canonical form of A. (For 
the Jordan canonical form, refer to Section 3-1. Here we shall state only 
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that similarity of two matrices can be defined in terms of the Jordan canoni- 
cal form; and the necessary and sufficient condition that n x n matrices 
A and B be similar is that the Jordan canonical form of A and that of B be 
identical.) ; 


EXERCISE PROBLEMS 
AND SOLUTIONS 


Problem A-2-1 Suppose that A and B are n X n matrices. Show that 
the square matrix AB can be resolved into a sum of a symmetric matrix and a skew 
symmetric matrix. 


Solution We can write 
2AB = (AB + B'A’) + (AB — B'A’) 
where 
AB -- B'A' 
AB — B'A' 


is symmetric 
is skew symmetric 


Problem A-2-2 Prove that a square matrix A possesses the inverse 
if and only if | A | Æ 0. 


Solution If |A| O, then adj A/|A | is the inverse of A, since 


also 


On the other hand. if B is the inverse of A, then AB — I and we obtain 
|A]]B| ^ |1] — 1 
Hence 
|A|+0 
Problem A-2-3 Prove that 
(adj A)(adj B) = adj BA 
where A and B are nonsingular. 
Solution 
(adj A)(adj B) BA 
= (adj A)(det B) IA 
= (det B)(adj A) A 
= (det B)(det A) I 
= (det BA) I 
= (adj BA) BA 
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Hence 
(adj A)(adj B) = adj BA 
Problem A-2-4 Vandermonde’s determinant of order n has the fol- 
lowing factorization 


1 1 1 
Xi X2 Xn 
2 2 2 
a Xa Xn = II (x;— x) 
I&icjen 
xp xen sce XE 


Verify this when the x; are distinct. 
Solution For n — 2 the given relationship is obviously true. To 
prove this relationship for n > 3, let us use induction. Assume that the given re- 


lationship is true for n — 1 and deduce that it is true for n. ni 
Expanding the determinant about the last column, we obtain a polynomial in 


x, of degree n — 1. Let us define 


1 1 1 
i Xi Xo Xn 
| = Sf (Xn) 
grob aget mes? age? 
The coefficient of the highest-degree term in x, is 
1 1 ona d 
X X2 tt Xn- 
ijo a s ap 
which is, by assumption, equal to 
(x — x;) 
Iqi<jqn-1 
Since 
f(x) =f) = +++ =fOn-1) = 0 


we can express f(x») in the following form: 


f(x) =F II (sy Xa — Gn — Xo): t: (Xa. — Xn-1) 
1<i<jqn-l 
it follows that 


fix) = II Gym 


igi<jcn 
This completes the proof. 


Problem A-2-5 Obtain the inverse of the following matrix A: | where A i 
"E; 
be OD a l A OMS H AtS 
01002, ; i linear equa- 
Ácm611axa Problem A-2-8 Consider the following n simultaneous 
= 3 
tions: 
9 00 T$ 4 | Gx, aare 9 px, =D; 
eee an Xy F dog Xa ++ + as X, = by 
Solution M PaA 
$080 =j Gai Xp + dgsXo + t + AnnXn = bn 
0100 ~a, which are equivalent to 
A~ =|0 010 —4, Ax =B 
Seyi =x where 
0000 1 Ga iy 995 Big bi ^ 
Problem A-2-6 Obtain A-! where A is given by Hg dig tell b. Xe 
= =|. |, x=|- 
1 1 1 1 dm] tae t dea B 
—fry3 3 — 1208 3 A a . . 
Aa=| ~” E * ii LEO b, Xn 
« =æ Ê —g ak d T — 
-a -a? —g: —g Assuming that |A| 0, obtain the solution of the simultaneous equations in the 
Solution form of Cramer's rule. 
AB ga Bian! 1 Solution x is obtained as 
1 8' -a -Ba l | x =A™'B 
AU = 1B? — a) A As An b, 
2(8* — a3)! gs =B -—a?B-1 —] ! ; n cn ‘ 
-a £3 giga 4j ^ Aus TAS. ES M : 
Problem A-2-7 Find the inverse of the following matrix BUE š à 
Ac 5 -$ -5 ] Ain Bon ct Ann bn 
2 2 | y b A 
A= 5 At Ss -5 —s i 2 att 
0 0 X 0 1| b, Ave 
L 6 0 0 aA ~ [Al 
Solution , 
TET Xs X si x. 2, bi Ain 
MA 4-5) UU x (X. + 5) z” e 
2 3 2 . Hence 
-—Es x(a r >) s $525 + st) ! dj c By 9 dg 
X d 1 Qo, e ba ET Aan [AO?| 
0 0 12 (12? + 12A + 4As3 + st) 0 x, = pg] b Au T [A] - . : = TAI 


0 0 0 ŠA + 12X5 + AAs + st) mr enssdb, Mus um 
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where | A9!| is the determinant obtained from A by replacing the jth column by 
the column matrix B. This gives the solution for x; (j = 1, 2, ..., n) and is Cramer's 
rule. 


Problem A-2-9 Prove that for n x n matrices A and B 


is Á — |A 4- B[*|A — B| i 
Solution | 
ls "ie ro "i "Ig. 0 |- IA  BI-1A — Bi 
B A B A B A—B 
Note that for n x n matrices A, B, C, and D the determinant 
A B 
C D 


is, in general, neither equal to | AD — BC| nor equal to |A||D| — |B|] C|. 
Problem A-2-10 Obtain 5. | A)| where 


au) a(t) +++ a) 
Q(t) asht) +++ galt) 


|A()] = 
at) aas(t) m Ann(t) 
Solution The derivative of the determinant | A(r)| is given by a linear 
combination of all minors of | A(7)| as follows: 


a(t) a(t) soe dat) 


i=l 


d n 
rau? 4 alt) 4 anit) es 2 alt) 


asit) asx(t) = SAS ast) 
a i=l 


n d 
A Ai ar e) 


Problem A-2-11 Suppose that xi, x, . . . , x, are a set of orthonormal 


vectors, or 
(x,x)=1 (i=j) 
=0 ~ Gy) 
(i,j = 1,2, ...,n) 
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then for vectors y and z defined by 
y—ax + A,X tree + anXn 


z= b,x, + bX;d see + bnXn 


show that 
(y, Z) = ā bı + Goby + +++ + Gnbn 
and 
(y, Xj) = di (i= 1,2, at) 
Solution Since 
L n m Ww LM 
(y,z) = (X aX 2 bsx,) = X 2&5 x)= X 4b 
i=l j= Ej: 
we obtain 


(y, z) = dài bi + dob; + +++ + ānbn 
By substituting z = X; we obtain 
(y, Xj) = di (= 15:2, 8) 


Problem A-2-12 For n x n matrices A and B 


tr (AB) = T x a;bj = tr (BA) 


Show that 
tr (AB/) = tr (A'B) = tr (BA) 
Solution Since tr A — tr A', we have 
Y X agb; = tr (AB) = tr (BA) 
i=] j=l 
also 
tr (BA’) = tr (A’B) 
Problem A-2-13 Show that if a scalar quantity fis defined by 
f — xAx 
then 
x'(A — A)x =0 
Solution Since f is a scalar quantity, the transpose of f is equal to f, 
Or 
x Ax = f = f' = x'A'x 
Hence, 


x (A— A)x =0 
Problem A-2-14 Given the characteristic equation 
fOr) =|A -M= (PA 4e Xt! + nn + eee Cn) = 0 


show that 
AA eee a= (—P ca = 1A] 


and 
X, XS bo RAS = tr A = (—)le, 
Solution The characteristic equation has roots Aj, A»,..., Àn- Ina 
factored form the characteristic equation becomes 
n 
Qu — MA o Aq) c Quo Nj e TECR —X)-0 
i= 


The only term which does not contain X is the term consisting of the product of 
— (i = 1,2,...,n), and it is equal to c,. Therefore, 


n 
€, =(—)* M A) 
or 
ArNe 22+ An = (—)"Cn 
To show that (—)"c, = |A|, write |A — AI] in the following form: 


Q—X 1» eee Qin 
Qo, Q3, — X An 
JA — M| = 
Anı 5» Ann — X 
4 Are Qin 
Qo, Ano Aon 
=f s . + |+ (polynomial in X of degree n — 1)X 
Ani Ana *** Amn 


Note that in the expansion of |A — AI| the only term that does not involve X is 
| &|. Hence we obtain 


IAI =(—1)"c, 
Notice that 
—Qu + Ag boots An) = (—)" 6 


Notice also that the trace of A is invariant under similarity transformation. There- 
fore, 


tr A = tr(P-! AP) =A, + Ay tees + AQ 
Hence 


Ay t Ag ters $A, =trA=(—)*'e¢, 
Problem A-2-15 Given the following n X n matrix 
—a 10 --- 0 
fie o: e iD 


show that the characteristic polynomial is 


JA — AT] = (YQ a AM? a NIU B e + an-A + Gn) 


Solution 
-G—*. 1 0 0 
=a; =% A 0 
lA —Al| = : : 
—5-1 0 0 1 
—ayn 0 0 —X 
—X 1 0 
0 —X 1 
= (—a, —) x 4 
0 0 0 
0 0 0 
TO 
—X 
; 2 0 
=(—a, —XX(—Xy* --(—Ya| . 
0 
+(-} 


—an 


= (—a, —A)(—A)* + a (=) aar 


he is wie E E dF (=) On 
— ( Y QA + a X73 AME AMS Hore H asa + On) 


0 


Problem A-2-16 Obtain eigenvectors of the following matrix: 


OF d 


0 
=O 0 
1 © 


0 


OQ » 


0 


0 
1 
0 
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Solution The characteristic equation is 


—Xr 1 0 0 
face Run 0 —2A 1 0 <2 
0 -—Xr 1 
1 0 0 —x 
Hence A; = 1, Ag = —1, A; =j, X, = —j. Let the eigenvector for A, be 


Xu 
Xai 
x; = 
Xsi 
Xu 
Xu 
Xoi 
Xsi 
X45 


Xai Ni Xir 
Xai | [Xs 
X4i Ai Xsi 


Ni X44 


Then 


= Cy SOS UO 
oor o0 
ll 


ooo 
or 00 


it follows that 


Hence the eigenvector for A; is given by 


Xu 1 

Xo; Ai 
Xsi Ai 
Xu A1 


^ 


where c, is an arbitrary constant different from zero. 


Let T be a matrix consisting of four independent eigenvectors of A. Then, an 


example of T is given by 


Low 
raph ^ ^ 
APOAPOAM 

M M M 

aac ex 

V 1 —1 y 
Nu NE MT 
Eo cs 


Xu 
Xoi 
Xsi 


EU 


i 
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! Probiem A-2-17 Show that for the following matrix A: 


A = 
0 1 
there does nct exist any nonsingular matrix T such that 
TAT =I 
Solution The eigenvalues of A are obtained from 
ee 3 | (1—2)! =0 
0 10—2AX 
[^ Thus the two eigenvalues are equal. Suppose that there exists a nonsingular matrix 
T such that 
TAAT =I 
Premultiplying with T 
AT=T 


or 
l. 1 P taj |i fis 
0 1 toy too foi too 
Solving for t;;, we obtain the following: 


T= à un Lie 
0 OQ 
Thus, T is singular. Since T is assumed to be nonsingular, this is a contradiction. 


Therefore, we conclude that there does not exist any nonsingular matrix T such 


that 
THAT =I 


Problem A-2-18 Prove that every eigenvalue A; (i = 1,2,...,7) of 
a unitary matrix (or orthogonal matrix) A has the magnitude of unity. (Hence the 
real eigenvalues of a unitary matrix must be either 1 or —1.) 


Solution Since orthogonal matrices are unitary, we shall prove this 
for a unitary matrix. Notice that there exists a column vector x such that 


Ax = Xx (I5 1 2. essit 
If we take the conjugate transpose of both sides of this equation, then 
x*A* = Ax* 
Therefore, 
x*A*AX = Xx ux 
Since A is a unitary matrix, we have A*A — I. Hence 
X*X = M A;X*X 
or 
(1 — AA gx*x = 0 


Since x*x = 0, we obtain 


or 
|r|? =1 
it follows that the magnitude of every eigenvalue of a unitary matrix is unity. 
Problem A-2-19 Prove that for a normal matrix A 


i . Ax = AX 
implies that 


A*x = Xx 
Solution 


I(A —ADx||? = (A — XDx, (A — ADx) 
= (x, (A — AD*(A — ADx) 
= (x, (A — AD(A — AD*x) 
= ((A — AD)*x, (A — AD*x) 


= ||(A — XD*x ||? 
From this we see that = 
(A —ADx = 0 
implies that 
(A —AN*x = 0 
or 
A*x = Xx 


Problem A-2-20 Prove that the eigenvalues of an n x n Hermitian 
matrix and of ann x n real symmetric matrix are real. Prove also that the eigen 
values ofa skew Hermitian matrix and of a real skew symmetric matrix ar i- 
ther zero or purely imaginary. "ES 


Solution Let us define any eigenvalue of ann x n Hermitian matrix 
A by X = æ + jB. There exists a vector x ~ 0 such that 


Ax — (a +j 

Then adi 

X*A* = (a — j8)x* 
Since A is Hermitian, A* = A. Therefore, 

X*Ax = (a — j8)x*x 
On the other hand, since Ax — (a + j)x, we have 

X*Áx = (a + j&)x*x 
Hence we obtain 


Kæ —78) --(@ -- jB]x*x 2 0 


or 
—2j8x*x = 0 
Since x*x z 0 (for x + 0), we conclude that 
8-0 


This proves that any eigenvalue of an n x n Hermitian matrix A is real.It 
follows that the eigenvalues ofa real symmetric matrix are real,since it is Hermitian. 
To prove the second half of the problem notice that if B is skew Hermitian 


then jB is Hermitian. Hence the eigenvalues of jB are real, which implies that 
the eigenvalues of B are either zero or purely imaginary. 

The eigenvalues of a real skew symmetric matrix are also either zero or purely 
imaginary, since a real skew symmetric matrix is skew Hermitian. (Notice that 
purely imaginary eigenvalues always occur in conjugate pairs in the real skew 
symmetric matrix, since the coefficients of the characteristic equation are real. 
Notice also that an n X n real skew symmetric matrix is singular if n is odd, since 
such a matrix must include at least one zero eigenvalue.) 


Problem A-2-21 Prove that eigenvectors associated with distinct 
eigenvalues of an n x n Hermitian matrix A are orthogonal. 


Solution Let us define n eigenvectors of A by the x;. Then, 
AX; = XX, (Den 1,2...) 
where the A, are the eigenvalues of A. (The à; are real.) Since 
AX; Xj) = (Ai X; X3) = (Ax, xj) = (x; A*xj)) = (x, Ax) 
= (x,, Xxj) = X,(Xo Xj) 
we obtain 
(A; — A) (X xj) = 0 
Since A; ze Xj, we have 
(xi, xj) = 0 


This proves that eigenvectors associated with distinct eigenvalues of an » x n 
Hermitian matrix A are orthogonal. 


Problem A-2-22 Show that any real quadratic form can be repre- 
sented by x’Ax or (x, Ax) or (Ax, x), where A is a real symmetric matrix. (The 
matrix A is called the coefficient matrix of the quadratic form.) 


Solution An arbitrary real quadratic form can be represented by 


n n 
D auxi T2 ayxix 
i=l i<j 


Let us put 
a Ay *** Amn xi 
Qj Qe ttt Aan Xe 
A=]. Y ^ X-|. 
Ain Go ctt Ann Xn 


Then we obtain 
Y auxi +2 > 4,,X,Xj = X'Ax = (x, Ax) = (Ax, x) 
izi i< 


By definition, A is symmetric. In addition, the a,; are real. Hence, A is a real sym- 
metric matrix. (Notice that real quadratic forms can be expressed in terms of rea! 
symmetric matrices and real vectors, whereas Hermitian forms can be expressed 
in terms of Hermitian matrices and complex vectors.) 


100 REVIEW OF MATRICES AND VECTORS 
Problem A-2-23 Show that if A is nonsingular, then A*A is positive 
definite. 
Solution j 
X* A* Ax = ||Ax||* 20 


Since the matrix A is nonsingular, Ax = 0 means that x = 0. Therefore A*A is 
positive definite. 


Problem A-2-24 Show that if a matrix A is positive definite, then 
A”! is also positive definite. 


Solution If A is positive definite, then we can write A = B*B 
for some nonsingular matrix B. Then A-! can be written as A^! = B^!(B-))*. 
This implies that A~! is also positive definite. 


Problem A-2-25 Determine the rank of the following matrices; 


i $e 3 = 
0 1 —5 4 
1 A= 
dd 13 —2 
2 5 1 


3 
2 

1 —1 4 4 
$ 

4 4 29 


Problem A-2-26 Show that if a square matrix A is reduced to the 
identity matrix by a sequence of transformations, then the same sequence of 
operations performed on the identity matrix produces A7!.- 


8 
> 
Il 
N 
N 


Solution 
(1) Rank is 2. 
(2) Rank is 2. 


Solution Let T,,T,, . . ., T, be transformations. If n transformations 
reduce A to I, then 


(T,--. T;T)A =I 
Postmultiplying A~! into both sides of the preceding equation, we obtain 


(Ta: ++ T,T) (AA?) = IA"! 


(T,-.. I; T)I = A"! 


Problem A-2-27 Show that the following matrix A can be reduced 
to the identity matrix by a sequence of elementary operations. 


hm. 


Show also that the same sequence of elementary operations performed on the 
identity matrix produces A-!. 
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Solution A sequence of transformed matrices is shown in the left 
column. The same sequence of elementary operations performed on I is shown in 
the right column. 


fl -—2 1 07 
s=] 1 E l 8 Jj 
r0 —3] 1 —1] 
HE X | 0 1) 
[0 1j =t 4] 
uox | 0 1) 
o n -— 4] 
Ll OJ | 5 3 
[1 01 E $1] xm 
m x L3 Jis 
Problem A-2-28 Given matrices A, P, and Q 
where A =n x m matrix 
P = n X n nonsingular matrix 
Q = m x m nonsingular matrix 
prove that 


rank PAQ = rank PA = rank AQ = rank A 
Solution Notice that 
rank PA < rank A = rank P-\(PA) < rank PA 
This implies that rank PA = rank A. Similarly, rank AQ = rank A. Hence, 
rank PAQ = rank P(AQ) = rank AQ = rank A 
Problem A-2-29: Given two n x n matrices A and B, prove that 
the eigenvalues of AB and those of BA are identical. 


Solution We shall consider first the case where A (or B) is non- 
singular. In this case clearly 


IBA — AI| = [A7 (AB)A — M | = [A7 (AB — ADA| 
— |A71]|AB — AI||A] =|AB — MI 


We shall next consider the case where both A and B are singular. There exist 
n x n nonsingular matrices P and Q such that 


eoir o] 


where I, is the r X r identity matrix and r is the rank of A, r < n. We have 
|BA — AI] = |Q7* BAQ — M| = [Q-! BP-!*PAQ — M| 


"ies ello ay ^ 
Ga G4Jl(0 0 


i E 
where ; Problem B-2-3 Show that if A = A’, then 


Q-!BP-: — [ Gi Gy ] x'B'Ax + x'ABx = 2x'(AB)'x 
Then G5 Go» Note that 
G, 0 (x'ABx)' — x'ABx 
|BA —A1| = | Pa il = a| i Problem B-2-4 Verify that 
aeq. 60 Oa) D 6 i Oi alc a cal 
-| G., DEC. |i P 9-9 PX Imi "ow ru 
= |G), — ALI —al,.,| i 0 10 a9 1 0 O0 OET Ou; 0*0 j| 
Also 0 0 1 aeiLIT O :0 0 1 0 0 jllg © © O] 
|AB — M| = |PABP-! — AI| = |PAQQ-'BP-! — x1 and then show that 
-Ig ee Gi —AI | 9.8 oor BeBe a, 1-00 v 0 0 
9 0J1G, ca] | e erc d m Ui 5 gi 
-|[(8 SEES: S- > , ee 0 als: t 3 | 
E | 00 e. ndis * X 
z mi G, | Sch 25 a, oO © 0 0. 
0 AE = ie 
= |G. — ALI- An] l. viste 
Hence we have proved that | Q O0 - O0 1 
0 0 ==- 1 0 
|BA — MI] = |AB — A1| | Bat =) s a 
PROBLEMS : à i : d 
m^ Problem B-2-1 Show that if A is a nonsingular triangular matrix Problem B-2-5 Prove that for an n X n real matrix A and a real n 
then A`! is also a triangular matrix. : vector x 
Problem B-2-2 Show that the following 4 x 4 determinant: (Ax, Ax) — > $3 0,5x,)* 
i=l j=l 
E d a Prove also that for real n vectors x and y 
TT b, b. b, b, ; « — 
Cr Ce C Cy (x,y)? = A xiyi + 2 Z p Xi XYY; 
di d, d, d, LN n-1n-i 
can be expanded into the following form: (This is called Laplace’s expansion by 2 à oid 2 = "thier 
enas) i Problem B-2-6 Prove that a necessary and sufficient condition that 
TT 4 «|l c| la a||bs A i a real matrix A be orthogonal is that 
| bid, dj [je eld, d, (Ax, Ay) — (x, y) 
4p a | bs by ey b, bj|a, a, 1 for all real vectors x and y. 
d, d.|es c, ĉi cel||dy d, Problem B-2-7 Suppose that A is a transformation from an n- 
= ie ba||as a, $ €1 C2j|az3 a; dimensional vector space V to an n-dimensional vector space V". Explain that if 
d, d.e c, d, d.||b, b, |A| = 1, then A is a rotation or a product of rotations. If | A] = —1, A is a reflec- 
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tion or a product of rotations and one reflection (or odd numbered reflections) 
and, therefore, the orientation is reversed. 


Problem B-2-8 Prove that 
lx + yl? t lx — yl]? = 2x1? +Ily ll?) 


Problem B-2-9 Suppose that vectors X;, Xy -..,Xm are linearly | 
independent and vectors y;, Yə, ..., y, are also linearly independent. Prove that j 
if y; (1 < į < n) can be expressed as a linear combination of x,, Xz, - - - , Xm and if | 
x, (1 <j < m) can be expressed as a linear combination of y;, Yz, .. . , Yn, then m 
must be equal to n. 


Problem B-2-10 Prove that if X is an eigenvalue of an orthogonal 
matrix A, then so is X-!. Similarly, prove that if X is an eigenvalue of an unitary 
matrix B, then so is A7}. ) 


Problem B-2-11 Prove that the characteristic equation of 
Gy Gig Qiz tt Gin 
O às de c on 
A-—|O0 O a -+> ag 
66 6 wx ws, 
is given by 
|A — M| = (aii — A) (a — A) **- (Gan — A) = 0 


(Hence the characteristic equation of a triangular matrix is determined by its 
main diagonal elements.) 


Problem B-2-12 Assume that A is an n X n matrix possessing n 
distinct eigenvalues. Show that the matrix 


A— ^l 
is of rank n — 1, where à, is the kth eigenvalue of A. 
Problem B-2-13 Prove that for n X m matrices A and B 
rank (A + B) < rank A + rank B 


Problem B-2-14 Prove that for an n x m matrix A and m xr 
matrix B 


rank AB + m > rank A + rank B 
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Coordinate Transformation 


3-1 INTRODUCTION 


For a given dynamic system we may define a state vector in a number of 
different ways. A set of the components of a state vector, or a set of state 
variables, is the smallest collection of numbers which must be specified at 
time t = t, in order to be able to predict uniquely the behavior of the system 
for any time ¢ > t, for any input belonging to the given input set in which 
all elements are known for t > /,. Hence the dimension of the state vector 
is the same, however it is defined. The different ways of defining a state vector 
therefore, can be viewed as transformation of the coordinates of the system. 
In order to gain insight into the dynamic behavior of a system, it is 
helpful to make a coordinate transformation so that the coefficient matrix 
of the state vector will be in a canonical form exhibiting directly its eigen- 


values on the main diagonal. Such a canonical form is called the Jordan 
canonical form, 


Jordan canonical form Ann x n matrix A is said to be in the 
Jordan canonical form if 


A, 0 
A, 
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where the A, are m, X m; matrices of the form 


&ox Bre & 4 
B x 0 0 
A=) 5 I o: L3 
Dd: ) 
Bo»4:s9 X 


(Ain A, and that in A, may or may not be the same) and m, + m, +--+ + 
m; = n. In the matrix A each eigenvalue X appears on the diagonal a number 
of times equal to its multiplicity. For example, in à 7 x 7 matrix À, if 
m, = 3, m, = 2, m = 1, m, = | and the eigenvalues of A areA,, A,, X4, Nis 
An Ag, Az then the Jordan canonical form is 


X d 0! 0 
Ü X 1! 

L| 
0 0 Xi 

[24 . d 1 

bf x! 

dances i----; 

| Ag | 

0 EA 


Notice that a diagonal matrix is a special case of the Jordan canonical form. 
Consider a iinear dynamic system 
X = Ax 
where x is a state vector (n vector) and A is an n X n constant matrix. Any 
state vector y defined for this system is related to x by 


x= Py 
where P =n X n nonsingular matrix 


Among a number of state vectors which can be defined for this system a 
convenient one for investigating the system’s dynamic behavior is a particular 
state vector y which yields the system differential equation in the following 
form: 
ý = P-'APy = Jy 

where J = P-'AP is in the simplest possible form, such as the diagonal form 
or more generally the Jordan canonical form. It is always possible to find 
a nonsingular matrix P that will transform an arbitrary square matrix A 
into the Jordan canonical form through the similarity transformation 
J= PAP. 
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The matrix J possesses the same invariant characteristics as A but is 
much easier to manipulate. Hence transforming À into the simplest possible 
form makes it possible to investigate intrinsic properties of the dynamic 
system with the minimum of manipulating labor. 

In this chapter we discuss in detail transformations of the coefficient 
matrix A into the Jordan canonical form, which includes the diagonal form 
as a special case. In Section 3-2 we present preliminary discussions on 
diagonalization of matrices. Specifically, we shall diagonalize the coefficient 
matrix of the vector matrix differential equation with distinct eigenvalues 
into the diagonal form. In Section 3-3 we obtain the necessary and sufficient 
condition that a matrix be similar to a diagonal matrix. Also several thco- 
rems related to the diagonalization of matrices are presented. In Section 
3-4 we treat systems with multiple eigenvectors. We shall obtain transfor- 
mation matrices such that in the transformed equation the coefficient matrix 
is in the Jordan canonical form. Section 3-5 presents a brief discussion of 
machine computation of transformation matrices . Section 3-6 gives a 
summary of transformation matrices that reduce the coefficient matrix of 
the second order system into the Jordan canonical form. In Section 3-7 we 
treat systems described by nth-order scalar differential /equations. We shall 
first transform them into state space equations and then obtain transformation 
matrices which transform the coefficient matrix into the Jordan canonical 
form. Finally, in Section 3-8 we discuss motions of linear free dynamic 
systems. 


3-2 PRELIMINARY STUDY—SYSTEMS WITH 
DISTINCT EIGENVALUES 


Consider the system described by 

X — Ax, x(0)—c 
If we can find a suitable nonsingular matrix P such that the transformation 
X — Py transforms the given system into 


y -P^7APy — y(0)—P-^c—é& 
where 
PUAP = diàg (As, X2) 
then the nth-order vector matrix differential equation (or the corresponding 
n simultaneous first-order differential equations) can be transformed into n 
first-order differential equations of the form 


yi Wa y«(0) = £, = 1,2...) 
which can be solved independently of each other. Such a process is called 
diagonalization of the coefficient matrix A. 
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In Section 3-3, we shall prove that for an n X n matrix A with n distinct 
eigenvectors such diagonalizationis possible if we take the n eigenvectors of 
A as coordinate axes and transform the vector x into vector y. (If A involves 
multiple eigenvectors, then A does not possess 7 linearly independent eigen- 
vectors, and A cannot be similar to a diagonal matrix.) 

In this section we shall present more or less preliminary discussions of 
diagonalization of matrices by use of a system with n distinct eigenvalues. 
(Such a system has z distinct eigenvectors. ) 

Consider the following set of n first-order differentia! equations: 


Ži AyyX, Haaat ee H aina 
Ža dqX, + dasXe + c7 F agnXn (3-1) 
Xn aX, + AnoX, torts + AnnXn 

x(0)— c,  x(0)—0»..., x0) = Cn 


where the a; are constant. Equations (3-1) can be written in the form of a 
vector matrix differential equation as follows: 


X — Ax, x(0)— c (3-2) 
where 
x(t) An 0g ctt Gin 
x(t) Az Qos >` Qan 
x(t)-— * ds A= s . 
X(t) Qn, Ano AUS ann 


Here we assume that A is an n X n constant matrix possessing 7 distinct 


eigenvalues, A., s, .. . , Ane - 
Assume that the solution of Eq. (3-1) or Eq. (3-2) is given by 


(f= ee“ 
xr) 


(3-3) 
x«t) = ee 
Or 
x(t) = ce™ (3-4) 
Substitution of Eq. (3-3) into Eq. (3-1) yields 
(aii — X)€ + arats + ++ + amn = 0 
451€, F (aza — X)€s + * 77 + danCn =0 (3-5) 


Bar F dae o ce =F (ann — A)Cn = 0 


Equations (3-5) have a nontrivial solution if and only if 


à; —X [m e Ayn 
|; JM. LL M E. (3-6) 
Anı n» exem Gy X 
or 
|A—AI|=0 


Equation (3-6) is the characteristic equation, or eigenvalue equation, its roots 
being characteristic roots or eigenvalues. 


Eigenvectors We shall show that if a new coordinate system 
is chosen, each coordinate axis being an eigenvector, the variables in the 
original n simultaneous equations can be separated. 

If ith eigenvalue à; is substituted into Eq. (3-5), then 
(ası — Madey + dis€s + -+ + anen = 0 
2,C, + (aaa — X)€s + +++ + ass = O (3-7) 
Ani Cy St nolo zw gc (Gan ar: Nien =0 
The eigenvalue 2, satisfies Eq. (3-6). For such a case, we cannot determine 


constants like c}, Cx, ..., Cn but only ratios like c,/c,. Referring to the deter- 
minant in the following equation, 


Mal' Harc CS IM 0 (3-8) 


Anı ane PRAE Ann — Ài 


if we define the cofactors of the first row as Ains Ains - - -, Atn and if A, = A, 
is substituted into Eq. (3-8) and the determinant is expanded in terms of the 
first row, then we obtain 


A Ary + lAn + ++ + amA = MA 


If the first row of the determinant in Eq. (3-8) is replaced by the mth row 
where m = 2, 3, ..., n (the value of the determinant is still zero) and the 
resulting determinant is then expanded in terms of the first row, we 
obtain 


amA + amA + +> + AmnAiin = Asim 
(m = 2,3,...,n) 


Thus, for A, we can get altogether n such relationships. Since the same applies 
to any A, we can get a total of n? equations of the following form: 


ay Ain + GA: + coo + AjAns = MAN; (3-9) 
EJ —1,2,3,...,n) 
We shall use Eq. (3-9) later. From this analysis we see tiat 66s os iun 
can be obtained as follows: 


€, = KAn 
€; = KAn (3-10) 
Cn = KA ijn 
where K is a constant. Hence, the x, defined in Eq. (3-3) become 
x(t) = KA,,,e*“ 
- Nu 
X(t) = KAijse (3-11) 


/ X(t) = KA ype 
Equations (3-11) hold for each eigenvalue X,, Az, . . -> Xn The general solu- 
tion of Eq. (3-1) is then given by 
xi(f) = Ki Ame" + K, Aet oo K, Ane 
x(t) = KA ye" + KA aet bie Krida” (3-12) 
x«t) = Khine F K,A,,,6* uo See Khant” 


where the K, are constants to be determined from the initial conditions. If 
the initial conditions are given as 


x,(0) = KA, 
x0) = Kiii (3-13 


x,(0) = KyAiin 
substitution of Eq. (3-13) into the following, which is obtained by letting 
t = 0 in Eq. (3-12): 
x(0) = KiAy + Kedar + +++ + KrAmı 
x0) = K, + KAn + coco Ki Ans 


x,(0) = KiAun + K,Aain + sn n K,Anin 


results in 


K. = K3=-+: = K,=0 
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This means that under such an initial condition the solution is given by 
x(t) = Kid" 
x,(t) = K,A,,.e*" 
of ) 144112 (3-14) 
x«t) = Ke 
Clearly Eq. (3-14) defines the following hyper line (in the n-dimensional 
state space): 


E ea ae a a 3-15) 

Aui as Tice ; 
(Note that if 4,,; = 0, then the corresponding x; is also zero.) Equation 
(3-15) has been obtained for the eigenvalue A,. There are altogether n such 
hyper lines for n distinct eigenvalues A; (i = 1,2, 3,. . . , n). Thus, we get 


E aem EE 3-16 
Ain Aino Ain ( ) 


 —1,2,...,n) 


Such hyper lines are eigenvectors. Eigenvectors are also called principal 
coordinate axes. [Am; is the cofactor of a; of the determinant in Eq. (3-8) 
when A, is substituted for \;.] (Sometimes we may have Ai = Ám = +- 
= Aj, = 0. We shall discuss this special case later.) 

Notice that for real à; (i = 1, 2,..., n) if the initial condition is on the 
eigenvector corresponding to A; then the point representing the system state 
for t > 0 always lies on the same eigenvector. (If A; and 2,,, are complex 
conjugate, however, then motions e*“ and e™=! cannot be separated. The 
combined motion takes place in the plane determined by the two eigenvectors 
corresponding to A, and A; The motion is oscillatory. For detail, see 
Section 3-8.) g 


Coordinate transformation A point in the n-dimensional state 
space can be represented by a vector x. If n eigenvectors of A are chosen as 
a new set of coordinate axes, the same point in the n-dimensional state space 
can be represented by a vector y. Since the eigenvectors are given by Eq. 
(3-16), the relationship between the x coordinates and y coordinates can be 
given by the following equations: 


Xy = Amı HAY + ++: H Anya 
Xa = AysYi + AnreVe + 7 + ÁnisYn (3-17) 
Xn = AynVi + ins + <- + nina 


provided we choose appropriate scales on coordinate axes. Equations (3-17) 
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can be rewritten in the foliowing form: 


x = Py 
where 
Ain Agi D Ani 
fig | ue as rrt An (3-18) 
Ain Aam co Anın 


P is called a diagonalizing transformation matrix or simply transformation 
matrix. If this coordinate transformation is performed, we can show that 
Eq. (3-2) can be written as follows: 

X — Py — APy 


or 
y = P^APy = Dy 


where D is a diagonal matrix. The initial condition terms are transformed into 


the following form: 
y(0) = P-'x(0) 


We shall now show the diagonalization of the coefficient matrix. 
Let us compute AP. By use of Eq. (3-9) AP can be obtained as follows: 


MAin Ns nii "ET Ka aii 
AP = MA MA NT nAn 
MA iin Ao sin iud AnAnin 
Ain Aoi pana Any ^ 0 0 
e Aus An Anis 9 Aa TP x " — PD (3-19) 
Ain Anin zig Anin 0 0 ss An 
where 
x, 8 0 
0 A, 0 
D= : 
0 0 even. Dog 


Since n columas of P are linearly independent, P is nonsingular and P^ 
exists. Therefore 
P-!'AP = D = diag (Ay, Aos - - -> An) 


With the transformation matrix P given by Eq. (3-18), Eq. (3-2) whose 
eigenvalues are distinct (which implies that the eigenvectors are also dis- 


tinct) can be transformed into 


y — Dy 
or 
dr A, 0 o][» 
Vo 0 A» eee 0 ys 
m ps L1 (3-20) 
Vn L 0 0 ais Xs Yn 
This can be written in the following set of n first-order differential equations: 
Ji =A) 
Yo = Mye 
Js = AnYn 
With initial conditions defined by 
y(0) = P-'x(0) 


Equation (3-20) shows that variables Vis Yos ++ + Yn are separated from each 
other. 


The characteristic equation or eigenvalue equation of the system of Eq. 

(3-20) is 

Q4 — AWA, — N::Q4—2)-—0 
There is no change in the eigenvalues. In fact, for any linear nonsingular 
transformation the eigenvalues remain the same. 

Note that, if P is a diagonalizing transformation matrix, then matrix PC 
Where C is a nonsingular diagonal matrix is also a diagonalizing transforma- 
tion matrix, since if 

PAP = D 
then 
C'^"P^"APC— C- 'DC = D 
Hence, P defined by Eq. (3-18) is not the unique diagonalizing transformation 


matrix. Each column of P may be multiplied by a nonzero constant. In 
general, the transformation matrix can be written as 


KA, KsAsy mns KnÁg 
k,A 2 kad, 2 Sag knAn 2 

ia ; 1 à 1 (3-21) 
kK Ayn kyAgyn wae kinia 


where k,, ks, ... , k, are nonzero constants. (Notice that each column repre- 
sents an eigenvector. Eigenvectors determine only directions of the co- 
ordinate axes in the n-dimensional state space.) 


Special case where A;,;; = Ai = +++ = 4,4, =O Sometimes 
it happens that 4,,, = A4, = +--+ = Åna = 0. In this case, we may take 
Aui Ais) --+ > Aun (k # 1) to form the ith eigenvector. [Notice that in 
obtaining Eq. (3-9) we expanded the determinant of Eq. (3-8) about the 
first row. This, however, is not necessary. We can expand the determinant 
about the kth row (k = 1) as well.] Consider the following example: 


X. 0 1 ol[x, 
X,|2|2 —l1 0|| x. 
Xs S 0 — 1 X3 


The three eigenvalues of the coefficient matrix are A, = —2, X, = — 1, 
A; = 1. Let us compute the matrix defined by Eq. (3-18) for this example: 
10 4 
—2 0 4 
—5 0 10 


The elements of the second column are zeros. Hence the inverse of this 
matrix does not exist. Instead of A, Asi Ans, let us take Aas Ans, Aasa 
as the elements of the second column. (The elements of the first and third 
columns remain the same.) Then, 


1 0 4 
P=|-—2 0 4 
—5 —2 10 
and 
st} =, 00 
P'=|0 id o— 
$ vz 0 


The matrix P obtained in this way will transform A into the diagonal form. 
To check, let us compute P^'AP. 


à -$ op 1 OFF 1 0o 4 
P'"AP=|0 $ —41|2 —1 | 2 0 4| 
io ue 0115-00 SHS Se 15] 

—2 00 

=| 0 —1 0 

0 0 ! 


N 
Computation of diagonalizing transformation matrix P (Ap- 
plicable when A has distinct eigenvalues) Note that, in free dynamic systems 
with distinct eigenvalues, once the distinct eigenvalues of the system are 
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known it is not necessary to find the matrix P in order to obtain the diagonal 
form. The diagonal form can be written directly from the knowledge of the 
eigenvalues. In forced dynamic systems, however, we need P-!. (Refer to 
Chapter 4.) In the following we shall discuss two methods for finding P. 

It should be noted that if A has one of the following two forms: 


0 1 0 e 0 0 
| 0 0 Do—ccÓE 
| 0 0 0 0 l 
Lur, ~ai Cia =a, =h 
or 
—a; 0 0 0 
—d, © 1 0: 0 
Ass k i 8 T 
—an-, 00 ... 0 1 
cus Ü U o 078 


then simple formulas are available for obtaining a diagonalizing transfor- 
mation matrix P. [Refer to Eq. (3-37), Eq. (3-50), and Eq. (3-52); and 
Prob. A-3-10 and Prob. A-3-11.] 


Method 1 A diagonalizing transformation matrix P can be 


obtained by computing 4; directly. For example, consider the following 
matrix: 


"n | 4 2 
—3 —1 
The eigenvalues of A are 1 and 2. Matrix P is given by 
P = m Z| 
Are Anis 


In order to find A,,, and A,,, let us substitute A, = 1 in the characteristic 
polynomial. Then we obtain 


jA- al =| ‘ J 
—3 —2 
Hence, 

An = =2, Aus = =(=3) 
To find A,,, and A,,, let us substitute à, = 2 in the characteristic polynomial. 
2 


À — X1| = 
lA—MI-T o. 
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We thus obtain 
Ay, = 3; 219 — -—[-—3) 


We therefore find P as follows: 
[—2 —3] 
— | 3 3 
Then we obtain 
PAP =| 1 || 4 a ae EL d 
—1 —£$ji—-3 —Iljl 3 3 0 2 
Formulas for finding diagonalizing transformation matrices P that 
transform the 2 x 2 constant matrix and 3 x 3 constant matrix into the 


diagonal form, respectively, are given in the following: 
(1) For the following 2 x 2 constant matrix A: 


A a b 
“le d 
where |A — AI| = (A — Ay A — 24); My SE Ag 


a diagonalizing transformation matrix P that transforms A into a diagonai 
matrix is given by 


Pe = Agi ix = A je 
Aur Aa =g —é 
(2) For the following 3 x 3 constant matrix A: 
qub e 
Asid e f 
g mu 
where |A — AT] = —(A — AA — X4) — As) 


and X, A», and A; are different from each other, a diagonalizing transfor- 
mation matrix P that transforms A into a diagonal matrix is given by 


[Ao Ana Ania 
P= Ain Aus Asie 
LAus Ans Asis 
[ |e — P3 e= X, f e — Ns I 
h 2 h i— nr, h i — ^ 
E F t f d f 
= dh i—» s im a" i—3» 
d e€—X d e— d e— 
i h f h l h | 3l 


Method 2 


Let us write 


Du Pre 
Po, Poe 


Pn, Pno 


Pnn 


Since each column of P is an eigenvector of A we have 


where P; is the ith column of the matrix P. To compute the elements of the 


AP, = MP; 


column matrix P,, we solve 


for Pisis s 
For example, consider the same matrix A as in the example of Method 1, 


or 


The eigenvalues of A are 1 and 2. To obtain the diagonalizing transformation 


(A — 
> Pris 


XDP;-0 (i= 


mE 


matrix P, we solve the two equations 


(A — DP, — 0 


which can be written as 


E 


These two matrix equations yield the following two scalar equations: 


3p, + 2p =0 
Pis Poo = O 
Since we have four unknowns and two equations, we can choose two un- 
knowns arbitrarily. Let us choose p,, = 1 and p,, = 1. We then obtain 
Pa = —3, Pa = —1 
Thus 
Ply -i 
—$ mm 1 
and 
—2 —2 4 2 1 ] 1. 0 
PAP = = 
[a “ala -il-g Al 3] 


pairs 


ie 


System whose distinct eigenvalues involve complex conjugate 
Assume for convenience that the system involves only one pair of 


and  (A—2DP,—0 
0 2 2] | Pre 
| 0 | ENS | E ia 


i 


0 
0 


| 


complex conjugate eigenvalues. This, however, does not limit the usefulness 

of the following discussion in any way, since extension to the case where 

there are more than one pair of complex conjugate eigenvalues is obvious. 
Assume that the eigenvalues A, and À, are complex conjugates and are 


given by 


Assume also that the eigenvalues ^z, A,, . . 


À =o T jo, 


A, =o—jo 


. , Àn are real and distinct. We can 


transform Eq. (3-2) into the form given by Eq. (3-20) or 


If the second transformation is made by the following equations: 


or 


then we obtain 


» =(¢+jo)y, 

Jo = (e — je)ys 

Vs = Ass 
SUL» 


1 5 
n»-32n-435 
-— S A 
Jat oT 2? 

H 25 
Yn = Zn 
ES dL 
rm ie 
por 
-w c" 
0 O 1 
0 0 0 
c 0 
—o g 0 
0 0 X 
0 0 0 
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or 
Zz, = 02, + 92; 
Z, = —@Z, 4.02, 
24 = NsZs 
Zn = AnZn 


It can be seen that z, and z, are dependent on each other, although z, (or 
Za); 23, Z4... , Zn are independent of each other. 
If two pairs of conjugate eigenvalues appear in the characteristic equation, 
then the system equation can be given as follows: 
d Zi 


ES [ Ci O; 


A —0, ©; 


re NV : 0 . An H 


This completes our preliminary discussions on diagonalization of coef- 
ficient matrices in state space equations which are vector matrix differential 
equations. In Section 3-3 we discuss details of diagonalization of various 
types of matrices similar to diagonal matrices. 


3-3 DIAGONALIZATION OF MATRICES 


In this section we present the necessary and sufficient condition that a given 

matrix is similar to a diagonal matrix. We shall also present associated 

subjects on diagonalization of matrices in the form of theorems and proofs. 
We shall first prove the following basic theorem: 


Theorem 3-1 An n x n matrix A is similar to a diagonal 
matrix if and only if a set of n eigenvectors is linearly independent. 


Proof Suppose that A has z linearly independent eigenvectors 
Pu Boross Pi LEU Assess, be the corresponding eigenvalues. Let us 
define 
P —[P,iP,i -++ iP.) 
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Since 


we have 
AP — PD 


where 
D = diaz (Ais Ns 25 An) 
Since n column vectors P,, P,,..., P, of P are linearly independent, P is 
nonsingular, and therefore P^! exists. Hence we obtain 
PAP = D 
It follows that if n eigenvectors are linearly independent, A is similar to a 


diagonal matrix. 
Conversely, if A is similar to a diagonal matrix, then there exists a 


nonsingular matrix Q such that 
QAQ =D or AQ = QD 


Let 
Q = [Q Qi +++: Qu] 
Then 
A [Q; i Qo! +++ | Qn] = [Qi Q: i Qh] D 
or 


AQ, XQ  (—1L2,4..,)) 


This implies that the Q, are n eigenvectors of A. Since Q,, Q,, ..., Q, are 
column vectors of a nonsingular matrix Q, they are linearly independent. 
Thus we have proved that A is similar to a diagonal matrix if and only if n 
eigenvectors are linearly independent. 

Notice that if an n x n matrix A having 7 distinct eigenvalues X, Aa, ..., 
X4 is similar to a diagonal matrix D, then the diagonal elements of D are the 
eigenvalues of A. Hence the diagonal matrix D is unique, except the order of 
the diagonal elements A,, 45, . .. 5 As. 

We shall next prove the following theorem: 


Theorem 3-2 An nx n matrix A possessing n distinct 
eigenvalues has z linearly independent eigenvectors. 


Proof Let n eigenvalues of A be 14, A», ..., An and corre- 
sponding eigenvectors be P,, P,, . . . , Pa, respectively. Then we have 


AP; = XP, (Pe 143,4) 
Consider the equation 
cp, + GP; + Ht CP = 0 


If n eigenvectors P,, P,, . . . , P, are linearly independent, then c,, Cs, ..., c, 
must be equal to zero. Multiplying 


(A — XD <- (A — M-a IAA — ALD --- (A — XD) 
into the foregoing equation we obtain 
(A — AD) --- (A — X IA — MaI) --- (A — 4D 
(CP, + CPs + o Py + +--+ + 6 P,) 
=(A—AD---(A— N- D(A — aD --- (A — A4Dc;P, 
= (Ay — Ay) +++ (Ar —42)04 — Aaa) ++ Qu — An) P: = 0 (3-22) 
where we have used the fact that 
(A — AJA — MI) = (A — MAIA — AD) 
and 
(A — XjDc;P, = Ac;P, — XjlcP, 
= ACEP, — X;c,P, 
= Oy — Xj)eP. 
—-0 (kj 
#0 (k Æj) 
Since the A; are distinct, Eq. (3-22) implies that c; = 0 (i = 1, 2,..., n). 
Thus, the n eigenvectors corresponding to n distinct eigenvalues are linearly 


independent. This completes the proof. 
The following theorem follows immediately from Theorems 3-1 and 3-2: 


Theorem 3-3 An n x n matrix A with distinct eigenvalues is 

similar to a diagonal matrix. 

From Theorems 3-1, 3-2, and 3-3, we deduce that if an n x n matrix 
A is to be similar to a diagonal matrix, then for each multiple eigenvalue A, of 
multiplicity m; there must exist m; linearly independent eigenvectors cor- 
responding to Aj. 

We shall next prove that any n X n matrix can be transformed into a 
triangular matrix. ' 


Theorem 3-4 An n x n matrix A, whose eigenvalues A;, 
As, ++, Àn are not necessarily distinct, can always be reduced to a triangular 
form, or 


Z-'AZ = 


where Z is a nonsingular matrix and the + denote elements not necessarily 


zero. 


Proof We shall first prove that there exists a nonsingular 
matrix X, such that 


I 


where A, = (n — 1) x (n — 1) matrix 
a, = 1 x (n — 1) matrix 
0 = matrix consisting of zeros 


Let us denote the nonzero solution of 
Ax = XX 

as x,. (Such solution x, exists, since |A — à, I| = 0.) Let us define X, asa 
nonsingular matrix having x, as its first column. Let us denote n columns of 
AX. as P, By oss Pus OF 

AX, = [Pii Pei --- i Pal 
and n columns of X;'AX, as Q,, Qs, ..., Qn, OF 

X;'AX, = [Q. iQ. | - -+ i Qu] 

Then clearly 


P, = AX; — XX, 
Since 
An Agi ee An 
1 Aj Aso cct Ans 
Nim ot A . 
Se MAI 
Ayn Aon s Ans 
where A,; is the cofactor of the element x,;, we obtain 
^ t AX; z [X| E 
j=1 
^ £ AjaXjı 0 0 | 
1 j=l - 1 xem | 
Q =X] X] 
Ki 2 AjnXj 0 |? | 


Thus, we have shown that the matrix X;'AX,, where X, has x, as its first 
column, will have A, as the first element of the first column and zeros in the 


rest of the first column. 
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The matrix A, has n — 1 eigenvalues, As, Ns, .. . , An. Let us define the 
solution of A,y = X,y as y,. By choosing y, as the first column of a non- 
singular matrix Y, we have 


"8 ld 
where A, = (n — 2) X (n — 2) matrix 
a, — ] X (n — 2) matrix 
Notice that by choosing X, as 
cE iO 
ao RE 
0: YJ 
we obtain 
1:0)" fàr i a, ] [110 
X;(Xr'AX)X, —|--L--| |--- iI--||---- 
0:Y 0 A,/10; Y 
Ari ay A t | * | 
| fe n FOR CAN =|0 A! 
ras | p 
0 lA, 


where the « denote elements not necessarily zero. Repeating the same process, 
we finally obtain 


À| * * 
Ne 
ZAZ = 
l * 
0 A 
where Z = KN oe Ks 


This completes the proof. 
Notice that Theorem 3-4 is useful in solving vector matrix differential 
equations. Consider the vector matrix differential equation 


X = Ax, x(0) = x, 
This equation can be transformed, by use of the transformation x = Zy, into 
y=Z'AZy,  y(0)= Z'x(0) 
Since Z^'AZ is a triangular matrix this last differential equation can be 
solved for y by solving first the scalar differential equation in y, and then 
substituting the solution y, into the scalar differential equation in y,_,, then 
substituting the solutions y, and y„-, into the scalar differential equation in 


Yn». Repeating this process, Vn, Y4.,, ..., Yı can be obtained. By the trans- 
formation x — Zy, we can obtain the solution x. 
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Unitarily similar matrices Ann X 7 matrix Bis called uni- 
tarily similar to an n X n matrix A if there exists a unitary matrix U such 


that B = U-'AU. 


Theorem 3-5 An n x n matrix A over the field of complex 
numbers is unitarily similar to a diagonal matrix if and only if it is normal, or 


AA* = A*A 


Proof Suppose that A is normal. Let A4, Xs +--+ > An be the 
eigenvalues of A and Xj, X:,--+> Xn be normalized eigenvectors associated 
with A4, Aa, -- -> Ans respectively. Let U, be a unitary matrix. If the first 
column of U is chosen to be x, then refering to Theorem 3-4 the first 
column of U;'AU, becomes equal to 


^ 
0 
0 
0 
Hence we may write 
Ay | a 
U;'AU, = | ------ 
01A, 
where A, — (n — 1) x (n — 1) normal matrix whose 
eigenvalues are Xe, Ag. ++ +> Xn 


a, = 1 x (n — 1) matrix 
9 — matrix consisting of zeros 
Since 
(U;'AU,XU;'AU,)* = UFAU,UFA*U, 
= UfAA*U, = UFA*AU, 
= U*A*(Uz')*U;'AU, = (Ui'AU))*(U; 'AU,;) 
U;' AU, is normal. Hence 


Mia, k 10] fh} 0] E 
: | A, Lat | At BAN i 


i is . 


or 


| 


it follows that a, must be equal to Gro, or a, = 0. Hence U AU 
zer > can e 
1 1 1 1 b 


le A, ON (n—1)x(n— 1) normal matrix whose eigenvalues are 
Puas 3, o + e 5 An. 
By repeating the same process, we c 
i 3 an find an (n — 1) — i 
matrix V such that i ULM des 


where A, is an (n — 2) x (n — 2) nor i i 
5 mal matrix whose eigenva 
Nis Nyy ..., Aa. By letting : 2s 


we obtain 
li 0 Ay d. i 
U; (U;'AU)U, = E as | EN A 
0! v-jl0 ! A, lO: v 
A, OF 
Ax f. od : ! 
= K RN | =|0 A, 2 
0 | VAV] | -=----- pec 
. A: 
Repeating the same process, we can finally obtain 
ry 07 
U-'AU = U*AU = i 
0 Xn 
where U-UU -U 


Hence we have proved that A is unitarily similar to a diagonal matrix if A 
1s normal. 


We shall next prove that if A is unitarily simi i 
! itarily similar to a dia i 
then A is normal. Let ETE 


U^'AU = D = diag (\,, Naya ee An) 
where U is unitary. Then, 


A = UDU"! 


and 
AA* = (UDU^)(UDU^?)* = UDU`'UD*U* 
= UDD*U* = UD*DU* 
= (U-')*D*U*UDU"' = (UDU"')*(UDU"') 
= A*A 
Thus we have proved that A is normal. Hence the theorem. 
From Theorem 3-5 we deduce that Hermitian matrices, skew Hermitian 


matrices, unitary matrices, and orthogonal matrices are unitarily similar to 
diagonal matrices. Hence these matrices can be diagonalized. 


Example 3-1 Diagonalize the coefficient matrix of the following 
state space equation: 


X = Ax 
where 
x, 1 1-j4/2 0 
x—-|xX, A=|1—j/2 2 iv 3 
x 0 =i s AN 


Notice that A is Hermitian; therefore it can be diagonalized by use of the trans- 
formation matrix whose columns consist of the normalized eigenvectors of A. 
The eigenvalues of A are found to be 
AX, =l, à = —]l, As =4 
For X, = 1, the eigenvector is obtained from solving (A — A, DP, = 0 for P,. 
Namely the eigenvector P, is determined from 


(1 +jV7 2)p3 —0 
(1 —j4/ 2)Pu + Pa tiv 3p —0 


—j4 3 p =9 
it follows that 
[Pu | =i 3a 
P, = | Pa | = 0 5 a = nonzero constant 
LPs1 L(1 — j~ 2)a 
Similarly for A, = —1, we obtain 
[Piz [a cj 2)b 
P, = | Poo| = —2b 5 b = nonzero constant 
LPs2 L —35 


and for X, = 4 we obtain 
Dis (1 Iv 2)c 
P; = | P | = 3c , 


P33 iv 3c 


= nonzero constant 
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—j/3a  (-4j42)b (j42)c 
P = [P i P,P] = 0 | O—2b 3c 
0—j42)a —-j/36b —jM/3 c 
By normalizing each column vector, we obtain the following unitary matrix U: 


DI IEI LEJ] 
A/ 6 A/10 Sis 
U-| o0 = 3 
4i — X5 
1—-j4/2  -j4/3  -i/3 
V6 A/10 A/15 


The unitary transformation x — Uy will transform x — Ax into y — Dy where D 
is a diagonal matrix, or 


y= U-AUy = U* AUy = Dy 


In fact, 
> iJ 3 DE ! 
^ /6 0 ——— p 14j/2 0 
yy |= 1 —j/2 —2 iv 3 


—jV/3 1-j4/2 l1-c4j4/2 
Was A/10 A/15 ^ 


: 0 = ES 
10 WAE Ja 
[edF SE ait 
[6 A/10 A 15 Ys 


LO 0 4jL» 
The initial condition is transformed into 
y(0) = U-!x(0) = U*x(0) 

With trivial modifications to Theorem 3-5 we can prove that real sym- 
metric matrices are orthogonally similar to diagonal matrices. (The proof is 
given in Prob. A-3-3.) In the following we prove that if A is real and 
is orthogonally similar to a diagonal matrix, then A is symmetric. In other 


words if A is over the field of real numbers, then only real symmetric matrices 
are orthogonally similar to diagonal matrices. 
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Theorem 3-6 Ann X n matrix A over the field of real num- 


bers is orthogonally similar to a diagonal matrix if and only if it is sym- 
metric. 


Proof The first part (sufficient condition) of the proof is 
quite similar to the corresponding part of Theorem 3-5. It is also proved in 
Prob. A-3-3. Hence, here we shall prove only that if an n X n real matrix 
A is orthogonally similar to a diagonal matrix, then A is symmetric. 

If A is orthogonally similar to a diagonal matrix, then there exists an 
orthogonal matrix T such that 
TAT = D 
Since T is orthogonal, T7! = T' and we obtain 
A = TDT = TDT' =A’ 


Hence A is symmetric. 


Congruent matrices Forn x n matrices A, B,and P, where P 
is nonsingular, if the following relationship exists 


PAP—B or A-PBP'-(P)B(P) 
then A and B are said to be congruent. 


Example 3-2 Obtain a diagonal matrix similar to the following real 
symmetric matrix A: 


1 —2 2 
A=| —2 3 0 
2 03 


Notice that such a real symmetric matrix A can be diagonalized by the use of the 
transformation matrix whose columns consist of the normalized eigenvectors of A. 
These normalized eigenvectors are linearly independent and are orthonormal to 
each other. 
The eigenvalues of A are obtained as 
hi = =], A, cm 3; Ag 5 
The eigenvectors corresponding to Aj, Xs, and A; are obtained from 
(A — X,DP, =0 (ies 1,23) 
Namely, for A, 
2p — 2pn + 2p3 = 0 
—2pui + Apa = 0 
2p + 4p = 9 


from which p, Pan and ps; are obtained as 


Pu —2a 


P, =| pa |=| —al, a = nonzero constant 
P31 a 
Similarly we obtain eigenvectors corresponding to A, and A, as follows: 
0 
P, =| 5 |, = nonzero constant 
b 
c 
P; =| —e], ¢ = nonzero constant 
c 
Hence 
—2a 0 c] 
P= [P; P, PJ=| —a b —c 
a b "i 


Normalizing each column vector of P we obtain an orthogonal matrix T that will 
transform A into the diagonal form as follows: 


-z 9 73 
Bw [oe 
E E WX 
1 1 1 
Ve 3 Ma 
In fact, 
T“AT = TAT 
z : - ag ee l 
ve So JE e t- 5 
- 0 or yz |o|l|-Ze Z3 -73 
T -4--3-|230 BR 1 
V3 X3 SS. dL FS VE. 34 
ay & ð 
=| 030 
005 


(For the given A in this example we need not use the orthogonal matrix T to trans- 
form A into the diagonal form. The transformation matrix P defined by Eq. (3-18) 
will transform A into the same diagonal form, since the given A involves only 
distinct eigenvalues. If a real symmetric matrix A involves multiple eigenvalues, 
then we may use Method 2 on page 118 or we may use an orthogonal matrix 
T for transforming A into the diagonal form. See the following example.) 


Example 3-3 Show that the following real symmetric matrix A is 
orthogonally similar to a diagonal matrix: 


2 1 —1 
A= 1 2 —1 
—] =! 2 


The eigenvalues of A are found to be 
X, = 1, As, A. = 4 
Eigenvectors corresponding to A, = A, = 1 are obtained by solving 
(A —A,DP, — 0 
or 
Pu + Px — Ps: = 0 
Pu + Pa — Ps. = 9 
—pu — Pa + Ps, = 0 


Notice that these three-equations are identical; hence we have three unknowns 
and one equation: This equation represents the eigenvectors corresponding to 
À, = X, = 1. We may arbitrarily choose p;; = 0, p; = a. Then p; = a. Thus 


Pu 0 
P, =| pn |=} a], a = nonzero constant 
P31 a 


The second eigenvector P, is obtained so that it is orthogonal to P, and satisfies 
the equation 


Pio + Po — P = 0 
From the orthogonality condition 


(P,, Ps) = 0 
we obtain 
Pz + pig = 0 
Hence we obtain P, as 
Pre | 2b 
P, = ze] =| —b|, b = nonzero constant 
P32 b 


For the eigenvalue A, = 4, we obtain the eigenvector P, as 


Piz c 
P, =| pa, |=| c|, c= nonzero constant 
Pa =ç. 
Hence 
0 2b c 
P = [P, P, P,])=|a —b c 
La b —c 
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Normalizing the column vectors of P we obtain the following orthogonal matrix T: 


1 1 1 

Lx 2 a 6 v 3 

The orthogonal matrix T thus obtained will transform A into the diagonal form. 
In fact, 


TAT = T'AT 


= i 
0 > ra 2 mara va Z3 
a a l 1 1 l | 
VE "we e| ! AHN A VT 
L. ed 1 1 | 
T ED S3 A3 -Z1 =L 2 JT ve we 
[1 0 03 
=!0 1 0 
Lo o 4j 


3-4 SYSTEMS WITH MULTIPLE EIGENVECTORS— 
REDUCTION OF COEFFICIENT MATRICES INTO 
THE JORDAN CANONICAL FORM (3-/)! 


Consider the following five matrices: 


[4 1 0 0 hou % ux d 
0x 1 0 ô X f à 0X0 0 
ES WU wy 0 0 0J 0 0X 1 
LO 0 0 A, 0 0 *" x Ü 4 8 x 
M 1I 0 © x 8 9 € 
0200 0 0 0 
0 0 x OJ 0 0 X, 0 
Ü 8B 6 x 0 0 0 x, 


Each of an five preceding matrices has the same characteristic equation 
(A — X4)* = 0. The first one corresponds to the case where there exists only 


‘See Reference at end of chapter. 
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one linearly independent eigenvector, since by denoting the first matrix by 
A and solving the following equation: 


(A — Dx = 0 


we obtain only one eigenvector 


a = nonzero constant 


The second, third, fourth, and fifth matrices have, respectively, two, two, 
three, and four linearly independent eigenvectors. (Notice that only the 
diagonal matrix has four linearly independent eigenvectors.) 

Ifann x n matrix A does not possess n linearly independent eigenvectors, 
then A is not similar to a diagonal matrix. It can be proved rigorously that 
an n X n matrix A possessing less than n linearly independent eigenvectors 
is similar to the Jordan canonical form, where the elements on the main 
diagonal are the eigenvalues and the elements immediately above the main 
diagonal are either 1 or 0 and all other elements are zero. We shall not prove 
this statement in this book,! but shall use the result. 

In this section we shall focus our attention on the matrix in which the 
multiple eigenvalue of multiplicity m yields a single eigenvector of multi- 
plicity m. We shall obtain transformation matrices that transform matrices 
with such multiple eigenvectors into the Jordan canonical form. For example, 
if A involves eigenvalues A,, Aj, Ai Ay, and As, we shall find a transformation 
matrix P such that 


fa, x v 6 "0 
0X 1 0 6 
P"AP-|O 0 X 0 0 
00 020 


0 0 00 X 


Notice that if we merely follow the method of transformation discussed in 
Section 3-2 for systems with distinct eigenvalues, then a difficulty arises 
because the matrix P given by Eq. (3-18) becomes singular and P^! does not 
exist. Therefore P matrix must be modified. 

We shall use a third-order system as an illustrative example and then 
extend the results obtained to more general cases. Consider a third-order 
system 


1Proofs of this statement may be found in a number of books on matrices. See for 
example (3-2). 


X — Ax (3-23) 
or 
X a b el 
Xs = d e f Xs 
X; gd ism 


If the multiple eigenvectors are double eigenvectors, then we may first assume 
that the corresponding double eigenvalues are slightly different from each 
other and write eigenvalues of A as An A, + AA, X4. We then construct a 
transformation matrix and let AX, — 0. If we denote a transformation 
matrix defined by Eq. (3-18) as P, then 


Au Aou Asi 
P= Ari Asis Aus 
Hus di Ans 
By letting X, = X, + AX, we obtain 
e— À y e—r,—Ar, f e—2AX f 
h i—AX h i—AX;— AX, h i-d, 
"AL ee e 
g i—Xx & i-r,—- Ad, 3 i= 
a e— À E fe IPC] d e— 
g h g h j h | 
By the transformation x — Py, we obtain 
Jy; Ay 0 0]f y 
J:i =10 X,4-AXM 0 ys 
Ds 0 0 As} Lys 
If we perform the second transformation defined by y = Qz, where 
0 -7x 0 
qe a AX 0 
0 0 1 
the system equation becomes 
2, ÀidT AM, 1 OgJ][z, 


2.) = 0 X; ojja (3-24) 


Now let AX approach zero, then Eq. (3-24) becomes: 


2 A, do TZ 
2,12 400: Ry Oeil a (3-25) 
Ža 0 0 Xz 


Equation (3-25) is the simplest form we can get when the system of Eq. 
(3-23) involves double eigenvectors. (It is noted that it is impossible to 


separate the variables z, from z,.) 
The transformation matrix S that transforms Eq. (3-23) into Eq. (3-25) 


.can be obtained as follows: 


S — lim PQ 
A, =o 
e—X—AX, f > dL e—A. fJ) 
n fcd s 2x43 AX, (i T e) h hes i, | 
= i jd f dos | 
o E T —— , EE i— À| | 
d e—ħ— A, d e— 
g h g g h | 
e— À if iis e— À; F 
h de] 99 DR 
"m fF d f " 
g i-^ ji g^ i— ^ R 
~ "2 E e—A; 
g h " g h 


Notice that the elements in the second column are derivatives (with respect 
to X,) of the corresponding elements in the first column. (Note that matrix 
S can always be made nonsingular. See Example 3-4 and Prob. A-3-5.) 

In the foregoing we have considered the case where the eigenvectors of 
A involve double eigenvectors. We shall next consider the case where three 
eigenvectors of A are all equal, or A has only one linearly independent 
eigenvector. In this case the three eigenvalues are equal, or X, = A, = As. 
In such case we can proceed exactly as we have done for the double eigen- 
vector system. We first let A, = X, + AX,, A; = X, + 2A), and define P as 


follows: 


e— Ài f e—r,—AA, f e—r,—2AA, E 
h i, h I—N Any h 1-312294 
dab T NE Tr EL | 
- Cl Bd E i-r; k 6-5] 
d e—X e e ee E e—r,— 2A] 
g h g h g h | 
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Then. the transformation x — Py will transform the system equation into 


[5 Ay 0 0 A 
P1—-[0 X AX 0 ve 
Ps 0 0 A, + 2AX, J Ls 
If we use the second transformation y — Qz, where 
fe l 
99 — Xu 
1 1 
= 0 ———— — -—— 
Q AX, AX 
JJ" CEDE S 
AX, 2Ar} 
then we obtain 
2; A, + 2A), 1 0 Ži 
i| 0 A+ AA 1 A (3-27) 
Zs 0 0 AJ L2s 
Now let AX, approach zero, then Eq. (3-27) becomes: 
Zi mM T UT 
Zí[-|0 A, 1 Z, (3-28) 
23 LO (0 Xm) Ley 


The transformation matrix S that transforms Eq. (3-23) into Eq. (3-28), 


the simplest form for the system with Ai =A. = As, can be obtained as 
follows: 


S — lim PQ 
àA, 0 
[le — X, — 2A), f 
h santa | 2X, + 2ArA,—-(i +e) 1 
= lim |J . f d 0 
odii E I — Ñi — 2A), 
| : dian 
g h 8 0 
[]e —À, T : 
h i — À dg 9 1 
d f 
"|u| 09 € e 
j A 
L. JE h d " 
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Notice that in Eq. (3-29) the first column consists of A,,;, the second column 
is the first derivative (with respect to A4) of the first column, and the third 
column is the second derivative of the first column divided by two. 

The preceding analysis can be extended to general cases. If the char- 
acteristic equation involves, for example, seven eigenvalues (A,, X. 14, Ais Ass 
As, A;) but has only three linearly independent eigenvectors, then the trans- 
formation matrix S that transforms the original coefficient matrix A into the 
Jordan canonical form can be given as follows: 


, 1 "n " 1 
Es Ain 2T Lil d Asi, Agi Ar, 
A A l yn Lyin A A: A 
112 112 2r 112 3T 112 512 512 712 
Aus Aha Aba DAM, Any Abs A 
113 113 Zu 113 VET: 113 513 513 713 
t 1 n "n + 
Si=j Aiu, Aa SpA spale Aiu Ai Ar (3-30) 
Has, Mi Ae dali dus din d 
115 115 2r 115 3T 115 $15 515 715 
A wat lyr Lym A A A 
116 116 2r 116 3r 116 516 516 716 
|4 A! ] 4" l y A Al A 
117 117 ain 117 3r 117 517 517 717 
where primes denote derivatives with respect to X;, for example, 
m 20 
Aw = dxj A09) 
d? 
RS E A: 
113 dn ust 1) 


Asis = Fe Assis) 


If the transformation x = Sz is carried out, then the transformed equation 
will have the following Jordan canonical form: 


Ži ^; 1 0 0! 0 e 
2 0 um £ 01 Z; 
Z3 0 8 x 4 Z3 
4| [0.0 0 Xi Z4 
| ~ RR---—---------- — —' 
E^ ih i1 Z; 
Że | 0 X | Ze 
box 0 i X: Z: 


Similarly, for the system having m identical eigenvalues A, =A, = -+ 
= Am, for which there exists only one linearly independent eigenvector, the 
first column of S consists of A,,;, the second column consists of the first 
derivative (with respect to X,) of the first column, and the ith column con- 
sists of (i — 1)th derivative of the first column divided by (i — 1)!, (i = 2, 
3,..., m). If, in numerical computation, all the elements of the first column 
turn out to be zero, then we substitute A; (A z 1; j= 1,2,...,7) for the 
first column, and the ith column is then the (i — 1)th derivative of the first 
column divided by (i — 1)!, (i = 2, 3, ..., m). Similarly, if all the elements 
Ap; (J= 1,2, ..., n) of the pth column (corresponding to the eigenvalue 
Àp) turn out to be zero, then we substitute 4,,; (k ~ 1) for the pth column. 

Note that such a transformation matrix derived in this section is not 
unique for a given coefficient matrix A. Suppose that an n x n matrix S is 
such that S^! AS = J, where J is a matrix in the Jordan canonical form. If 
a nonsingular matrix T is equal to SV where V is nonsingular and satisfies 
JV = VJ; then T^! AT = J. Clearly, since T equals SV, we have 


TV = and = VT-'ATV-! = 
or 
TAT = VV = V1VJ = J 


We thus see that matrix S is not unique. Consider as an example the following 
A: 


2 1 3 
A= 4 1 2 
-4 2 
The characteristic polynominal is 
|A — A1| = —QA —2A—2A—) 
A transformation matrix S can be obtained from Eq. (3-26) as follows: 
.20 ıı _20 
13 13 
48 100 
S = =5 4 ar 
16 24 40 


13 13 13 
This matrix S will transform A into the Jordan canonical form, orS^! AS = J, 
where 
M 1 0 
J=|0 X 0 Q=2, m=) 
0 O A; 


The matrix V which satisfies JV = VJ will have the following form: 


v, 9 0 
V—|0 v O0 
0 0 * 


Here v,, v, and v, are arbitrary constants satisfying v, v, # 0. If we choose, 


for example, 
= 3.13 = 5 
u= —i Vg = —2'43 V3 Ü 


Dj 


then the following matrix T 


5 1 
12 348 * 
T=SV=| 1 & -3 
—à 0 1 


will transform A into the same Jordan canonical form, or 
T'AT—J 
Example 3-4 Consider the following matrix A: 
5.40 
=| 0 1 0 
—4 4 1 
The eigenvalues of A are obtained as 
Am l, A, — l, As =5 
For the eigenvalue A; = 5, the corresponding eigenvector is obtained by solving 


(A — 5Dx, = 0 
for x,, or solving 
, 4xs, =0 
—4x3, — 0 


—4xi + 4x23 — 4x; = 0 


for Xis, Xas, and x4,. The result is 


X13 —r 
Xa | = Ql; a — nonzero constant 
Xss a 


For the multiple eigenvalue A; = Xs = 1, the corresponding eigenvectors may 


be obtained by solving 
(A — Dx, = 0 
or 
4x1, + 4x31 =0 
0x, + Oxo, + 0x4; = 0 
—4x1, + 4x, = 0 
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from which we obtain only one linearly independent eigenvector 


E 1] 07] 

H H 
EZA | =| 0 | b = nonzero constant 
X31 b 


This means that the given matrix A possesses only two linearly independent eigen- 
vectors. Hence A can not be diagonalized. The simplest form we can obtain is the 
Jordan canonical form: 


1 1 0] 
J=/0 1 d 
0 OQ 5 
Let us obtain by use of Eq. (3-26) a transformation matrix S such that 
S-!'AS — J 
Notice that the first column of S given by Eq. (3-26) becomes zero, or 
e— À F 0 0 0 
h i —A 4 0 
fe Lal EEREIS 
Zg i—XAX —4 ojll 
E e— " 0 0 
g h —4 al 0 
Hence we modify Eq. (3-26) as follows: 
d 
Aw (X) dx, 41211) A31 Q4) 
d 
S =| Ais (Ay) dx, 41221) Ais (83) 
d 
Atos (4) dx, 4123) Asis (83) 
[ -| ] c | b e — à ff 
h i—X h i— ^ 
» a— Ài c 3 š d f 
ss en AFO dro». 
a—-n, b d e—Xx 
Ie g h k E h | 
= 40 —4 0 
| 4 0 s 4 — | 
0 4 
=) 4 0 = 0 0l fe p 
a -|_° *- EST ZEE 
| 44 : "de —32 4 —16 
L |—4 4 —4 4l 
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S^! is obtained as 


<tr =h =A] 
S7 = 0 -} 0 
& 4 od 
and 
—4 -h —3 5.40 0 4 16 1 dq Q 
SAS = 0 —i 0 0: I 0 0 —4 |= |o 1 0 
is di 0 —4 4 ] —32 4 —16) 00 5 


3-5 MACHINE COMPUTATION OF 
TRANSFORMATION MATRICES! 


The hand computation of the transformation matrix P or S discussed in 
Sections 3-2, 3-3, and3-4 for an n x n matrix A becomes laborious if n = 4 
or 5 and becomes impractical if n > 5. For n > 5 machine computation may 
become a necessity. 

This section presents the principle of machine computation of the 
transformation matrix. The process of computing the transformation matrix 
starts with computation of the eigenvalues of A. This can be done by a 
standard technique (3-6). Hence we assume that the eigenvalues of A are 
already found. 

We shall first consider the case where the Jordan canonical form of A is 
diagonal. Let us denote the transformation matrix by P, or 


P = [P,1P.i--- i Pal 
where P,, P, ..., P, are as yet unknown eigenvectors. To determine 
eigenvectors P,, we successively solve the equation: 
(A — ADP, = 0 
Suppose that the eigenvalues of A are 
Ni Ai s» es Ais Nl Am+2 sese $ Nn 
where Xi, Asi, Ams, <- - s Àn are different from each other. (Extension to 
other cases will become obvious.) If the m eigenvectors corresponding to 
m equal eigenvalues A, are linearly independent, then the Jordan canonical 
form of A is diagonal. The eigenvectors P,, P.,..., P,, are found by per- 
forming elementary operations on A — AI until the last m rows of the 
matrix are identically zero. This is possible since the rank of A — ^I is 


n — m. We shall define the resulting matrix with m rows consisting of 
identically zero elements as R;. Then R; in a reduced form can be written as 


'This section closely follows Reference (3-5). 


Tim 
0 1 Ü fa Yos 
R, = 0 0 l Fn-m,1 Fn-m,m 
00. 0 0 0 
0 0 .. Ø 0 T 0 
The m eigenvectors corresponding to A, are 
Tu Fim 
LT Tom 
Fn-m 
Boe] ee fan ny N e m 
—1 0 
0 : 
: 0 
0 -— 


as may be seen by performine t iplicati = 

Har P g the multiplication R,P, (i = 1, 2,...,m) and 
| The eigenvectors corresponding to 1,,,,... , X, are found by perform- 
ing elementary operations on A — X (i =m + 1,...,n) until the last row 
of the matrix is identically zero. This is always possible because A — AI is 
singular for X = X,. If the matrix is reduced to the form 


L O 2 O px 
© L e D J 
R-iio oio: 
OO «ss T pa 
00 --- 0 0 
then clearly an eigenvector for 2, is 

ry 

P 

R=] = 

Tn-1 

=1 


The transformation matrix P can thus be determined. 
Next, consider the case where the Jordan canonical form of A is not 


diagonal. This occurs when the eigenvectors for the multiple eigenvalues of 
A are not linearly independent. Suppose that A involves m equal eigenvalues 
À, and other eigenvalues are all distinct, or the eigenvalues of A are 
A Xs, eoog Ay, Ams Naro 3 ere Xn 

Suppose that there is only one linearly independent eigenvector for \,. [Note 
that there are other cases where q(1 <q <m) linearly independent eigenvec- 
tors exist forA;. Analyses of such cases involve no particular difficulty.] Once the 
eigenvector S, is found, the other vectors S,, S,, ..., S, may be easily 
computed from the following relationships: 


(A—XJDS,—S,., (k=2,3,...,m) 


Note that 
(A — A,DS, = 0 
because S, is an eigenvector for \,. From the relationships just given we have 
(A= NDS = S, (3-31) 
and 
(A — 2X,D"S,, = 0 (3-32) 


Applying the same row reduction process as before to (A — AI)” we find 
m linearly independent vectors satisfying Eq. (3-32). These are tested to 
find one that satisfies Eq. (3-31). S, is not known at this point, but any 
vector on the left side that gives a product not identically zero qualifies as 
Sm, and the nonzero product is an eigenvector. 

We are now ready to construct the transformation matrix. From 


AS; — XS,  (i—-1,m--1,m-42,...,n) 
AS, = Spar FASL ~ (k—2,3,...,m) 


we see that a matrix consisting of n column vectors S,, Sa, ..., S, satisfies 
the matrix equation 
AS — SJ 


where J is the Jordan canonical form. Since the columns of S are linearly 
independent, its inverse exists and it follows that 


J = S'AS 


Note that complex conjugate eigenvalues could be handled in exactly 
the same way as real eigenvalues by putting complex conjugate columns in 
the transformation matrix. This, however, wastes computer storage space 
and calculation time because each of the array must be doubled in size to 
hold the imaginary components. Rather than doing this it may be advan- 
tageous to modify the Jordan canonical form. The modification is based on 
the fact that one of a pair of complex conjugate vectors contains all of the 
essential information of the pair. 

Let us consider the case where the Jordan canonical form is diagonal. 
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Other cases can be handled the same way. Notice that the diagonal matrix 


[Ay 0 
J= g 4- jo 
g — jo 
LO As 
can be transformed into the following form 
X; 0 
j= | 7 2| 
! —@ o " 


0 N 


[1 0 


—— MT Ó 


0 E 
LE 
Namely the modified Jordan canonical form Î is given as 
j = K-JK = K^P-'APK 


Not only does J have only real elements but, more significantly, K^' P^' and 
PK have only real elements. It may be seen that the effect of postmultiplying 
P by K is to set one column equal to the real part, and the next to the im- 
aginary part of the complex eigenvector. Both vectors are entirely real. All 
other columns of P are merely copied. K^! has a similar effect upon the 
rows of P-'!. Setting 
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the transformation is given the standard form of 
j—P-AP 
In use we simply enter the real and imaginary parts of the appropriate 


eigenvector as two adjacent columns of P, and proceed to use it as though 
it were a true transformation matrix. 


3-6 SECOND-ORDER SYSTEMS 


Consider the system defined by the following two simultaneous first-order 
differential equations: 

X, AX + 42X2 

Xy = A,X; + AyoXp 


where d,,, d,5, A21, and Az; are constants, or 


x a a x 

|| n all íi (3-33) 

Xa As, Ano} | Xa 
We shall denote two eigenvalues as A, and A,. Since such second-order 
systems occur very frequently in engineering fields, we shall give special 
attention to these systems in this section. 


According to the coefficient matrix we can divide Eq. (3-33) into the 
following three cases: 


Case 1: The system involves two distinct eigenvalues The following 


transformation: 
Age | E 
Xo —05 — d» ys. 


ki = er —ai Me ul E 
Xo ay, —5 V: 
will transform Eq. (3-33) into the diagonal form: 
eia alka 
de 0 AIL 2 
The initial condition terms are transformed into 
pel a 1 ie aui = aj perd 
y:(0) aa — A)L an Az — and L-x2(0) 
Case 2: The system involves complex conjugate eigenvalues The fol- 
lowing transformation 


or 


M _ [20 dX) da S" ^ 
X2 — Qs, 0 J»: 


will transform Eq. (3-33) into the following form: 


fal Me H 


where c and o are defined by 


X, =o jo 

X; =0 — jo 

The initial condition terms are transformed into 
»O| fo -a-i [ao 

Tano 1 = 
y2(0) Gey 304 +A — 2ay1) | | x«(0) 


Case 3: The system involves two equal eigenvalues For this case, 
the following two possibilities exist: 


(A) If a4, = a4 = 0, a = a», then the original equation is already in the diagonal 


form, Or 
| X | | ay 0 | Xi | 
X» 0 yu Xo 


(B) Except in the special case stated in (A), the following transformation 
[3] - [m^ Jid 
Xo As, 0 ye 
Dole iat “lea 
X» A21 OS Ly. 


will transform Eq. (3-33) into the Jordan canonical form: 


mires is 


The initial condition terms are transformed into 
bed z zl 0 1 | E 
¥2(0) anlan Ay — ayy} L xo (0) 


3-7 SYSTEMS DESCRIBED BY SCALAR DIFFERENTIAL 
EQUATIONS (3-/, 3-3)! 


or 


Because of frequent occurrence of nth-order scalar differential equations or 
their equivalent vector matrix differential equations, we shall in this section 


‘Figures in parentheses relate to items in list of references at the end of the chapter. 


discuss simplification of state space representation of nth-order scalar differ- 
ential equations in detail. Consider the following equation: 


P + ex + -e + aniž + a,x = 0 (3-34) 
where the a, are constants. If we define 
Ny =X 
x, =x 
a-n 
X= X 


then. Eq. (3-34) can be written as follows: 


x = Ax (3-35) 
where 
X 
Xs 
x= 
Xn-1 
— Xn _ 
ind 
0 1 0 0 0 
0 0 1 0 0 
sps i, : : : (3-36) 
0 0 0 s 0 1 
—ü, —ü,., —Gn-g *** —4, —4, 


Notice that the eigenvectors of A defined by Eq. (3-36) become distinct 
if and only if the eigenvalues are distinct. Stated differently, eigenvectors of 
A corresponding to mutliple eigenvalues become multiple. This may be 
seen easily if we realize that the eigenvector corresponding to an eigenvalue 
A; is determined from 

(A —A,DP, = 0 
or 
=A Pu t+ Pa =9 


Ai Pa + Ps: = 0 


Ai Pn-11 + Pn = 0 
—4n Pin — Gn-1 P21 — c: — A Pn =O 
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Aside from a constant factor there is only one set of values p,;, P2 - -© +3 Pai 
not all of which are equal to zero, that satisfies this set of m equations. 
This is true whether A, is a distinct eigenvalue or a multiple eigenvalue. 
Hence the eigenvector corresponding to the mutliple eigenvalue of mul- 
tiplicity 7 is also of multiplicity m. The coefficient matrix A of the system 
described by Eq. (3-35) where A is given by Eq. (3-36) can be diagonalized 
if and only if A has distinct eigenvalues. 


Nth-order scalar differential equation with distinct eigenvalues 
Consider Eq. (3-35). We assume that the eigenvalues of A are distinct. The 
transformation matrix P given by Eq. (3-18), rewritten thus 


Ain Agi, irn Any 

Are Asi mene Ani 
P= , : s 

Arin Asin iE Anin 


can be simplified, since elements 4;,; can be obtained in terms of à; as follows 
(Refer to Prob. A-3-6): 


Ain, = = 
Aiia = (—)'an 
Ais = (—) ari (i = 1; 25 wae fl) 


Ain = (— Ya M7 


Hence the transformation matrix P is simplified into 


1 jl Susa ii 
Ai Xs ces 9 
P=| WG Xd ce M (3-37) 
AF Amet one et 


After the transformation x = Py is made the coefficient matrix becomes 
a diagonal matrix and the equation can be written as 


yy E O0 = DATE 
de 0 X = Olly, 


Vn 0 0 eae Xn Yn 
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The initial condition is given by 
y(0) = P^'x(0) 
It is noted that if some of the A; are complex conjugates, for example, 
A, = À and X, = Xz, then the diagonal elements of the coefficient matrix 
involve complex elements. In such a case by another transformation the 
coefficient matrix can be transformed into the following form: 


! 
0, e 1 0 
| 
—@, c! 
-Á Á {iť a 
I 
i 9» Ws 
1 
0 | —@y Gs 


Nth-order scalar differential equation with multiple eigenvalues 

We shall next consider Eq. (3-35) with multiple eigenvalues. The transfor- 

mation matrix which transforms the coefficient matrix into the Jordan 

canonical form can be obtained by the same technique as employed in 
Section 3-4. (Refer to Prob. A-3-7.) 

Suppose that the matrix A in Eq. (3-35) is a 7 x 7 matrix and the eigen- 

values of A are Ai, An Ay, Ay, As, As, Az- Then the transformation matrix is 


given by Eq. (3-30) in which 4,11 A119 - - - , Ariz are given by 
P MESI 
Aus = À; 
Ai = 
Ái M 
Ave = 2 
Aire = Ar 
Aja M 


In other words, the transformation matrix S in this case is 
[1 90 0 0 1 oO 1 
A 1 0 0 As E X% 
x? Dn, 1 0 Xi 25 X 
SX 3M 3A, X M BAS M 
M 4 O2 4A M A OM 
A$ SAt 103 10X) Aj 5M AG 
[AS 6A 15M 20M M GAZ Mj 

Notice that the second column is the first derivative (with respect to X,) of 
the first column, the third column is the second derivative of the first column 


divided by 2!, the fourth column is the third derivative of the first column 
divided by 3!; and the sixth column is the first derivative (with respect to 
às) of the fifth column. 


As an example, consider the following system: 
X 4X 5% +2x=0 


The eigenvalues of the characteristic equation are —1, —1, and —2. The 
transformation 


x = Sy 
where. 

x JA 

memi] 3E y= | Ps 

x Ja 
1 0 1 1 0 1 
S —|X l As| —|—l 1 —2 
Mi DM XM 1 -2 4 


will transform the original equation into 


» “1 1 OT», 
J| = 0 —1 0 ys 
Ds 0 0 —2)Lys 


As an another example, consider the following system: 
i 0 I OpEx 
$;| 210 0 Tl lke 


Xs 1 —3 3jix 


x 


* 


The eigenvalues of the coefficient matrix are X, = A, = A; = 1. Hence, the 
transformation: 


x. = Sy 
where 
1 0 O0 100 
Scl i OLlse|ti X 0 
ME 2M 1 i1 2 1 
will transform the given equation into 
yy 1 1 OTJ 
Jo} =|O 1 1) | ye 
Ds 0 0 IJLys 


3-8 MOTIONS OF FREE DYNAMIC SYSTEMS 


The solution of the following state space equation: 


x = AX x(0) = X, (3-38) 


can be obtained easily through the transformation of A into the Jordan 
canonical form. We shall first consider the case where A has distinct eigen- 
vectors. Since it has been shown that by means of a suitable transformation 


x = Py we can transform Eq. (3-38) into 


y = P^'APy = Dy (3-39) 


we shall treat Eq. (3-39) instead of Eq. (3-38). 
Consider the following system: 


Ji A, 0 0 Jn 
Js 0 X, 0 || y: 
Ýn 0 (0) ace Die Ya 


The solution of this state space equation can be immediately obtained as 


y(t) e^ y,(0) 
y(t) e` ya(0) 


Lyatt)| Ley) 


or 
y(t) = QG)y(0) 
where 
et 0 
e 
Qe) = 
0 et 
Since y = P-'x we obtain 
y(0) = P^'x(0) 


Thus, if the solution x(t) of Eq. (3-38) is desired, it is given by 
x(t) = Py(t) = PQG)P^'x(0) 
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Example 3-5 Obtain the solution of the following state space equa- 


si ri 9 OTT 
l=} 0 —2 8 =| 
E 0 0 —3) Lx, 


The solution is obtained as 


xi) e^! x,(0) 
E = iem 
x(t) e=! xy(0) 


tion: 


or 
x(t) = Q(t) x(0) 
where 
ect 0 0 
Q() -|° ex 0 | 
0 Q. wt 
Example 3-6 Obtain the solution of the following state space equa- 
tion: eo 
[ x, c d- jo 0 0 X 
s 0 c — jo 1M 
FD. 0 lx 
The solution is obtained as 
x(t) e * je x (0) e*' (cos ot + j sin ot) x,(0) 
E = E] = l^ (cos ot — j sin 208) 
x(t) L eM x,(0) e^t xs(0) 
or 
x(7) = Q(r)x(0) 
where 


0 0 e^t 


Clearly, x(t) and x(t) represent an oscillatory motion with its amplitude increas- 
ing or decreasing according too > O ore < 0. 


a — 0 
TE] Q eria 1 


Next we shall consider the case where A has multiple eigenvectors. We 
shall assume that by means of a suitable transformation x — Py the fol- 
lowing state space equation: 


x = Ax 
is already transformed into 
y = PPAP =- Jy 


where J is in the Jordan canonical form. Consider the following system: 
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yy ^, 1 0 » 
y|2|0 X Ll re 
Vs 0 Q AIL 
This is equivalent to the following three equations: 
Ji = My Ys 
Js = M Ya + Vs (3-40) 
Vs = Ms 
Solving the last of Eq. (3-40) we obtain 
Yt) = e"ty«(Q0) (3-41) 


Substituting Ec. (3-41) into the second part of Eq. (3-40) we obtain 
Po = M Ye + e"*y«(0) 
which yields the solution 
yt) = eè y0) + te y,(0) (3-42) 
Substituting Eq. (3-42) into the first part of Eq. (3-40), we obtain 
Ji = May, + ety) + te y0) 
Solving for y, we obtain 
y(t) = ety 0) + te y40) + 3-156 yO) 

Hence the solution y(t) can be written as follows: 

x] [ew te site y,(0) 
yt) |=] 0 et tet y(0) 

jb 


ys(t) 0 eu y3(0) 
or 
y(t) = SC)Q(C)y(0) 
where 
La 
LU d =! 
S(-7|0 1 z 
0 0 1 
and 
et 0 0 
Q= 0 e" O0 
0 0 eM 


Similarly, we can extend the same analysis to more general cases. As an ex- 
ample consider the following system: 


Ji à, 
de 0 
Vs 0 
y a 

Ys 

Js 

Js 0 


1 0i 
h.c b] 
Ü X1 
ir, 1 
LEY 


The solution y(t) can be written as follows: 


or 


where 


and 


wo] [edt rent Lren | 


Y(t) 
y3(t) 
y(t) 
ys(t) 
Yelt) 
y(t) 


0 e 
0 0 
E 
S(t) = 
e 
Q(r) — 
0 


I 
| 
I 
[i 


tert 
et | 
NK PE 
[0 et | 
Mean adiecta ER T 
| eu 
t. 
y(t) = S(t)Q(r)y(0) 
1 [j 
E pm 
E 
0 I1 R- 
00 1 | 
---------=--- i-----4 
Uy p 
lo 1j 
1] 
0 10 
et 
et 
eu 
eut 
ent 


07] 


In terms of the original state vector x(t), the solution is given by 
x(t) = Py(t) = PS(r)Q()P^x(0) (3-43) 


Notice that Eq. (3-43) is the general solution of Eq. (3-38) if we interpret 
S(t) = I for the case where A has distinct eigenvectors. 


A few comments on motions of free dynamic systems (3-4) Con- 
sider a system described by x = Ax where A is an n X n constant matrix. 
Assume that the eigenvalues of A are X,,A,,---, Ay, Amst. -- -> Àn- In such 
a system any free motion can be represented uniquely as a linear combi- 
nation of motions e (i= m 4-1, m--2, ..., n) and tře" (h =Q, 1, 
2,...,m — 1). If à is real, the motion e* is along the eigenvector x;, the 
eigenvector corresponding to the eigenvalue A;. If A; > 0, then the motion 
e* along the corresponding eigenvector approaches infinity, and if A, < 0, 
then the motion eè“ approaches the origin of the state space. If, for example, 
X44; ANd Amy. are complex conjugates, however; then the motion e*=- and 
motion e*=* cannot be separated from each other. The combined motion 
takes place in the plane determined by eigenvectors X,,, and Xn»... The 
motion in this plane is oscillatory. (Whether the oscillation is damped or 
growing depends on the sign of the real part of X,,,,.) 

The amount of each motion e*' (j = 1, m + 1, m + 2, ..., n) depends 
only on the initial state. (Each motion is excited by the initial condition. For 
a system with only real X's, if the initial condition is on the eigenvector 
Xi(i = m+1,m+2,...,n), then only the motion e* occurs. If the initial 
condition is on the multiple eigenvector xı, only motions eit, fe‘, ..., and 
17-1eht will occur. Notice that any initial condition that will excite the motion 
et will also excite the motions te^, tet, ..., and pe", 


EXERCISE PROBLEMS 
AND SOLUTIONS 


Problem A-3-1 Obtain a transformation matrix P such that P-! AP 
is diagonal. A is given by 


6 1 8 
A-| s @ 2 
s uw .% 


Solution The roots of the characteristic equation |A — AI | = 0 are 


found from 


= 2 |=-Aa+NDA+2a43)=0 
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as A; = —1, A, = —2,and A; = —3. A diagonalizing transformation matrix P is 
given by 
| —M 2 =À: 2 — s 2 
—7 —6—2, —7 -6—-r, —7 —6—X 
lion - 3 2 | - 3 2 E 3 2 
—12 —6— —12 .—6 —24 —12 —6—2A 
| | 3 hs | 3 — 3 —» 
—12 —7 —12 —7 —12 —7 
| 1 s 2 2 3 2 
-7 — —7 —4 —7 — | 
ES | 3 2|] — 3 2 3 2 4 b " 
3 1 | i 2 3 3 PT mw "NE 
—12 —7 —12 -—7 —12 — | 


We may divide each column of P by a constant. If we divide the first column by 9, 
the second column by 3, and the third column by 5, a diagonalizing transformation 
matrix occurs as x 


1 2 1 
P,=|—1 —4 —3 
—1 1 3 
The inverse of P, is obtained as 
PoPog 
P =| -3 =2 =i 
i 3 1 


As à check let us compute Py'AP,. 


| Ow" D o3 1 
PDAP,—-|—3 —2 -I 3 8 ai - o 
3. 3 ul- —7 -6}/L-1 1 3 
-1 0 0 
=| 0.2 n0 
0 0 —3 


m Problem A-3-2 Obtain a transformation matrix P such that P-! AP 
is diagonal. A is given by e 


A= 


Assume that A has distinct eigenvalues. 
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Solution Let us denote four distinct eigenvalues as Aj, Xs, A4. and Ay, 


or 
[A " Ai] =A — X). — MA — 24) — Ay) 


The elements in the ith column of P are given by 


E deg — Xi a23 DENI 7 
a32 4i — M a34 
ayo Os Gu — M 
KEN a23 Go, | 
- 3i az; — M s, 
ayy 55 ag, — M 
P; = 
Qoi an — M a24 
| 3 32 a34 
au Gs Qu — Ni 
[m o — M a23 
- 3i azz a33 — : 
ayy Ayo Qs iJ 
(i = 1,2,3,4) 


Problem A-3-3 Prove that any n x n real symmetric matrix A is 
orthogonally similar to a diagonal matrix. 


Solution Let us denote the eigenvalues of A as Aj, A5, ..., A4. The 
A; are real. There exists a normalized real eigenvector x, satisfying 


AX, = Ay X; 


Let T, be an orthogonal matrix with x, as its first column. Then the first column of 
T; AT, is obtained as follows (refer to Theorem 3-4): 


^ 
0 
0 
Hence noting that T, is orthogonal and A is real symmetric, we may write T; 'AT, 
as follows: 
Ti AT; = |---i--- 
0 | A, 
where A, = (n — 1) x (1 — 1) real symmetric matrix whose 


eigenvlaues are As, 45, ..., An 
0 = matrix consisting of zeros 
Similarly, we can find an (n — 1) x (n — 1) orthogonal matrix S such that 


where Ay = (n — 2) x (n — 2)real symmetric matrix whose 


eigenvalues are 34, 4, ..., An 
By letting 
] 54 
f, elL-——L— 
0s 
we obtain 
1i o] o] 50 
Tz'(Ti 'AT)T, = s ao [oeil Mm ---- 
0:S"*jJL0 ;AJLO!S 
a A a me” 
C0 oH | se ae PS Ae: LU 
E uui  MemIcm 
0 ' S-'A,S S lx 
1 2 
Repeating the same process we can finally obtain 
Ai 0 
Ag 
TAT s T'AT = 
0 Ng 
where T =T,T, ---T, 


Problem A-3-4 Suppose that an n X n normal matrix A possesses 
real eigenvalues. Prove that A is Hermitian. 


Solution Since A is normal, there exists a unitary matrix U such 
that 


U" AU = D = diag(44,25,..., An) 
where Ay, As, ..., Àn are real. Then 
A = UDU-! = UDU* = UD*U* = A* 
Hence A is Hermitian. 


Problem A-3-5 Find a transformation matrix S such that 


S7AS = J 
where 
4 1 —2 3 1 0 
A=|1 0 2|, J=|0 3 0 
1 —1 3 0. 0 1 


Solution The characteristic equation becomes 
JA — AI] = —A3 + 7A? — 15x +9 = —(A — 3)(A — 3)(& — 1) 


Hence the eigenvalues of A are 3, 3, 1. Simple computation reveals that there exist 
two linearly independent eigenvectors. Therefore, it is possible to transform A to 
J by a suitable matrix S. Such a matrix can be obtained from Eq. (3-26). We shall 
denote this as S,. 


—h 2 —A; 2 | 
in AM Bs 78 | 1 3—X 
1 2 1 NE 2 | 
mie -| 1 Ex] i. 5-5» 
1 —r, 1 1 —Xs | 
| 1 —1 1 —1 
By substituting A, = 3, X, = 1 into S,, we obtain 
2 3-07] 
S,2|2 1 0 
2 1-0 


Notice that the elements of the third column are zeros. Therefore, we must use 
other than As 3s; Asis for the third column. Let us take Assis Aso, Ass; Where 
the Az»; are the cofactors of the second row of the determinant 


a—» b c 
d e— J 
g h îi — Ms 
Namely, 
b c a — s; c A de 7 b 
Ası = — P , Asn = TR 333 = g h 
h i— g i " 
Then S becomes as follows: 
—A 2 2x 3 1 —2 
—1 3-r : =f 3=% 
1 2 1 4-—» —2 
S= -| 1 ap id E NE 
1 —A 1 _|4—-As 1 | 
| 1 -1 1 —1 |_| 
2 3 0 
=|2 1 8 
21 4 


ituti i i AAS = J. 
By substitution, we can easily verify that S 
Notice that S is not unique. If we postmultiply S by V, where, for example, 


1 


1 
a 
ud B uh 6 
zh 2 
1 
6 9; x 
then 
120 
SV=|1 12 
1 1 1j 
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will satisfy (SV)-'A(SV) = J. Hence examples of transformation matrices that 


transform A to J are 
2. 3 0 I. 2 07] 
l; D oW P zu etc. 
[2 1 4 bi db 3 
Problem A-3-6 Given the matrix 


0 1 0 xus "0 0 
0 0 1 0 0 
wm D 5 
g- ff we we 3 
—Gi -—u-; Gane c0" ae  —4j 
show that the cofactors A;,; (i, 7 = 1, 2, . . . , n) of the determinant formed as 
|A — rll 
are 
Ain = (qe 
Ain = (—)"a, 
Aus = (—)" andi (3-44) 


Ain = (—) anri? 
( ER a) 


Assuming the eigenvalues of A are distinct, show that the transformation matrix 
defined by (3-21) that transforms A into a diagonal matrix can be given by 


1 1 1 
Mr X Xs 
p- M M p 
ADU XU AR 
Solution 
—r, 1 0 0 0 
0 —r; 1 0 
lA-XEH-| i o: ; ; ; 
0 0 B. sx am i 
Saa eer üp E Sy A N 


The sum of the products of the elements of any row and the cofactors of correspond- 


ing elements of a different row is zero. Thus, for the elements of the second row 
and cofactors of the first row: 


OA — MAn + Ais + OAD + +++ + OAnn = 0 
Thus, 
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Aus = Xi Aiia 
For the elements of the jth row (j = 2, 3,..., n — 1) and cofactors of the first row: 
OA + c OA4ao-o — Aa; t Áagsp + OAngsn te + 0 Ann —0 


Thus, | 
Angin = Aas (3-45) 


For j = n — 1 we obtain 
Ain = Ni Airin- 


An can be computed as follows: 


0 —M » 410 0 0 

0 0 —r; 1 0 0 

Ain = ey b : . . . 
0 " i s py 0 —M 
—A, —An-1 a + 0 =a;  —üs 


Expanding by minors about the first column 
e. Tq Uee OÀ 
Ais = (y (iei (a) 
0 Gi Q - Q^ Ay 


Again expanding by minors about the first column 


— Ài 1 0 0 0 
7 0 A, 1 0 0 
Aan = (—)7**a,(—X) : : 
0 0 0 ss 0 =Ni 


It can be easily seen that by continuing the expansion about the first column the 
result is E 


2 


Aus = (—*a( X)( X47 = (aM 
This proves the last of Eq. (3-44). Then using Eq. (3-45), rearranged 


1 
Aij = x ducem 
we obtain 
1 1 1 1 
An = x 402 = y dm = xpi = (—)" an 5 


Aus = (—)" an 
Aus = (—)" ani 


The transformation matrix P that transforms A into the diagonal form can be 
obtained as follows: By dividing each column of the following matrix (4), the 
matrix defined by Eq. (3-18), 


" 1 —) pa n —n Pu 
(—) xat ( ) x ( ) ai 


(—»*a, (—»Fa, ses (—)"a@, 


(—)"a, 7? (—)"a, rz? see (—Ya,A27* 


by (—)? a, (1/X,) we obtain P in the following form: 


1 1 1 
Ay Aa Àn 
p=| M M MÀ 


XR! XE e. me 


Problem A-3-7 Consider the system 


X, 0 1 0 Kiar 0 0 x, 

Xo 0 0 1 sac 0 0 X 

& pep 8 : : AS : (3-46) 
Xa 0 0 0 ex d 1 isa 

x Gn —Gn-) —Gn-» +++  —d$4 —Q Xn 


Suppose that the characteristic equation involves a pair of double roots. Derive 
a transformation matrix S such that after the coordinates are transformed the 
equation is in the Jordan canonical form. 

Next obtain a transformation matrix that will transform the system equation 
into the Jordan canonical form when the characteristic equation involves multiple 
roots of multiplicity four. 


Solution We may first assume that the double roots are slightly 
different from each other and write the roots of the characteristic equation as M, 
A, + AA, As, ..., Àn and obtain a transformation matrix S and then let AX, — 0. 
If we denote the transformation matrix as obtained in Prob. A-3-6 as P, then 


1 1 1 stes 1 
^ hi > ANA. Ns we 
P=| . " " E (3-47) 


Lar (Ay AX ARTE e ARTI 


After the transformation x = Py is made, the system equations become 


Ji = Mi 
Jo = Qa + AX)ys 
Js = ys 


Yn = XnYn 


If we perform the second transformation given by y = Qz, where 


0 0 0 
then we obtain the following equations: 
2, = (A, + AAD cz 


Ža = Ay Zo 
: _48) 
Žž = ÀZ; (3-48) 
Zn = NnZn 


Notice that although, P becomes singular as AX; —> 0, PQ is nonsingular as 
AX, —> 0. Letting AA, approach zero in Eq. (3-48) we obtain 


A —MZ + 22 

Ža = À Z2 
Z,-X,2; (3-49) 
Zn = AnZn 


Equations (3-49) yield the Jordan canonical form when the characteristic equation 
of Eq. (3-46) involves double eigenvalues A, = As. The transformation matrix S 
that transforms Eq. (3-46) into Eq. (3-49) can be obtained as follows: 


S — lim PQ 
d 1 0 $o a, A 
Ai + AX, 1 às iiini Xn 
Qa + AA) 23, + AM ms eg M. 
= lim | (A, + AX? 3A + 30, AA, + AN M - » | 
TIE (A, AXE AXE MAN, + AQAMCRAM M oce M. | 
: ; bi ane EAL 
(A, + AN)! : A3 An 
1 0 ] o 
X 1 Ks wae Au 
M 2X X  àe N "T 
Tw A 3X1 o evn IA 


XP! (n— DAP MU ee X 


Notice that the elements in the second column are derivatives (with respect to ^i) 
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of the corresponding elements in the first column. After the coordinates are trans- 
formed, the equation in z coordinates becomes as follows: 


A] Pa 1 0 w+ ojja 


Zs 0 X, 0 «+ Ø pA 
à4á|2|0 0 X» + O]] 2s (3-51) 
Ži 0 0 Q0 z ^ Zn 


If the characteristic equation involves multiple roots of multiplicity four, then the 
following transformation matrix: 


l 0 0 0 1 ium 1 
M 1 0 0 Xs 20 Àa 
M 2X; 1 0 M o M 
S=| 3 3A} 3X, 1 ab 6 XE 
n-1 d hn- ] a? » 1 d? - 7 - 
E i qo 1) ara 1) tamer) az E aw A2 1 
(3-52) 
will transform Eq. (3-46) into the following form: 
A A; 1 0 0 i 0 z 
2p O0 » i 0 | " 
23 0 0X 1! " 
^ |= 9 0 0 ^ Z; 
" E" "MI 
^ j D Kac Ltn 


Notice that in Eq. (3-52) the second column is the first derivative (with respect to 
14) of the first column, the third column is the second derivative of the first column 
divided by 2!, and the fourth column is the third derivative of the first column 
divided by 3!. 
Problem A-3-8 Consider the differential equation 
ž+až+aš+ax=0 

or its state space representation 

X [ 0 1 0 x, 

Xo = | 0 0 1 Xo 

Xs —a; —a, —a,| Lx; 
Assume that the characteristic equation includes two equal eigenvalues, or the 
eigenvalues are X,, Ay, As. Show directly that the following transformation 
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x = Py 
where 
0 1 1 
Peli X Ns 
224, M M 


will reduce the original equation into the following form: 
yr Mm 0 0 yi 
yo|=|1 M O||»: 
Js 0 0 X Js 
Solution We shall denote the coefficient matrix of the given state 


space equation as A. 


0 1 0 
Ac 0 1 
—a; —@, —4ü 


By the transformation x = Py, we obtain 
Py = APy 
Forming the product AP, 
0 i 0 0 4 29 
AP-| 0 0 1 1 X X 
—a, —a, —ajL2M Ab OM 


1 ee rs 
= 2r1 nN 25 
—a,—2a,M -—ay— aM —4a — a, — a — eM. 


Since the eigenvalues are Ay, ^, Az, the characteristic equation becomes 


(X ADA — ADNA — Ag) = AB (72.4. — NN + (AZ + 23424)& + (M4) = 0 


Hence we obtain 


a, = —2d, — X 
a, = M + 2.455 
a; = —M^ 


The matrix AP can then be written as 


E At As 
Apue|2wu RP OM 
31 x X 


Since P-! is obtained as 
[aids —Ah, MM-M MM 
A — 2M As 2X, —i 


Piwsr—a 5 
= EN 
M cS || M s 1 


we obtain 
1,0 0 
P-AP—|1 à 0 
0r 300 X 
Notice that 
9 1 OIfAa, O oTe 1j 9 `à 1 0 
10 Oji à 0]|[1 0 O/=|6 X, 0 
9; 0 ILO O X10 ð 1 0 O X 


Problem A-3-9 Simplify the following state space equation: 


x, 0 1 0 0 x; 
ž| | 0 0 1 0 x. 
X3 SX 0 0 1 xs 
X4 —44 —4, —a —a, X, 


where the characteristic equation 
AA + aX? + ar? + aX +a,=0 
is assumed to possess two identical roots and one pair of complex conjugate roots 
that is, A, = As, A; =o +jo, à =0 — jo. 
Solution The following transformation 
X = Py 

where 

1 0 1 1 


A, 1 c 4 jo a — jo 
M 2X, (ec joy (e —joy 
M 3M (e-cjo) (e—joy 
will transform the given state space equation into the following form: 


Vy A; 1 0 0 Jy 
Ja = 0 A 0 0 Y» 
Bi [L8 U aja ü Ys 
Vs 0 0 0 gs — jo Ja 
Let us carry out the second transformation 
y=Q 
where 
1 0 
0 1 
Q-|o 


then the transformed equation becomes 


A & 10 B $ 
4| |0» 0 OF] z 
4| |9 $ o 
Z 0 0 -w c 


Problem A-3-10 Given the following state space equation: 


X — Ax 
where 
X —a, 1000 
Xe —a, 0 100 
X =| ol, A=|-a, 0 0 1 0 
X4 —a, 0 0 0 1 
Xs —a 0000 


obtain the transformation matrix P defined by Eq. (3-18) that will transform the 
coefficient matrix into the diagonal form. Assume the eigenvalues of A are distinct. 

Solution The (i,j)th element of P is A;,;. In order to determine 
Aij let us form the characteristic equation and substitute X = A, in the equation. 


—a,—AX 1 0 0 0 
— ds —M 1 0 0 
|A—AJI| = —a; 0 -X 1 0 |20 
—4, 0 0 =À; 1 
—d; 0 0 0 —X 


Since the sum of the products of the elements of any row and the cofactors of cor- 
responding elements of a different row is zero, we obtain for the elements of the 


last row and cofactors of the first row: 
—@5Aiy, — AAs = 0 
Thus, 


d; 
Aus = — X, Ain 
t 


For the elements of the fourth row and cofactors of the first row: 
—a,ÁAu — Ang + Ans =0 
Thus, 


Similarly, we obtain 


Aj, is obtained as 
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Hence we obtain 
Ay; mM 
Aus = —(aM + aMi + ar; + à) 
Aus = —(a3M + ad} + ashi) 
Au, = —(aM + 45M) 


Ans = —a;r} 
Then P can be given by 
ig x4 zi ne 
—(a9Mi + asdf + ar, + a) —(ayr3 + aA + a,r5 + a) 
P= —(ash} + ari + azı) —(a34$ + aM + aNs) 
—(a4M + aM) hS —(a,r3 + aA) 
—a;r} e x —a;r3 


Notice that since 
Mi + aM + ard + aM + ary + a; =0 
we have 
(adj + a3M + ar; + a5) = A$ + art’ 
—(ajM + aM + ary) = AS + 41M + ani 
—(aAM + aM) = M + aM + aM + aM 
adi = X + aM + aM + ay + art 


Then, by dividing each column of P by Ai we obtain a transformation matrix P 
as follows: 


ia 1 eee 1 
| A +a eee As +a 
P, = AG + ary + as ee AS + GAs + a; 


| E+ aM + ar, +a; AS + ajM + as + a; 
LM-raM-c ag? + ashy +a, e att 4,5 + ad} + ad; + a, 
Problem A-3-1] The transformation matrix P defined by 
1 1 1 
P- Ay + a, Ae + a, As +a, 
M+ ar, +a. M Had ta Aba añ; + a 


will transform the coefficient matrix of the following state space equation into the 
diagonal form: 


x = Ax 
where 
Xi =a, 1 0 
X=| xl, A=|-a 0 1 
X3 —a, 0 0 


|A — M| = —Q? + aM + aX +a) = —(.— X). — 34). — As) 


orovided that à. 1. and 3. are dictinet Shaw that if T —3 3 the fAllevie 
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matrix S will transform the coefficient matrix into the Jordan canonical form: 
1 0 1 
S= ` +a 1 As +a 
Mitad ta 2D +a Btar; +a 
Show also that if A, = A. = A,, S must be modified into T where 


1 0 0 
T= A, +a, 1 0 (3-53) 
Ai + ar, +a M +a 1 
Solution 
Case 1: where X, = Xs Æ à; The transformation 
x-—Piy 
where 
1 1 1 
P, = Ay +a, A, + A +a A3 +a, 


M 4M +a. Qu + AX Hal AX) + ag AFH ad + d; 


will transform the original state space equation into the following form: 


yy ^ 0 0 ^ 
Je|2|0 A,+AA, O||lys 
Vs 0 0 Xs Js 
If we perform the second transformation 
y=Q,z 
where 
0 -5x 0 
Q= RS 0 
0 0 i| 
then we obtain 
EA A SAM 1 OTFR 
2, |= 0 A, 0 Zo 
Z 0 0 »lLz 


Noting that P, Q, is nonsingular as AX, —> 0 we see that by the transformation 
x = Sz = lim P,Q,z 
3,0 
we can transform the coefficient matrix of the original state space equation into 
the Jordan canonical form. The transformation matrix S is given by 
1 0 1 
S= im P,Q, = A, +a, 1 As -+a 
AY,-0 
' aad +a. M Ha M Ha H as 


Case 2: where X, — A, =X; The transformation 
X = Pyy 
where 
1 1 1 
P= " A, +a, Xi +AA +a, X T2AX, +a, 
l Ai +a +a, (X, TAX Ta, HAA) +a A, T2AX)' -+a (A, --2AX;)--a, 
will transform the original state space equation into the following form: 


J 


Vi ry 0 0 Ji 
J:|[-|10 A+ AX, 0 X2 
Js 0 0 Xi + 2AX, Ys 
If we perform the second transformation 
y = Qz 
where 
0 Pee 
9 ^ SAX 
Us] wa. — 1 
AX, AX 
p E 
AX, 2A 


then we obtain 
^i + 2AA, 1 0 Zi 
"E 0 A, +AA 1 Zi 
Zs 0 0 AL} Lz 
Noting that P, Q, is nonsingular as AX, —— 0 we see that by the transformation 


X = Tz = lim P,Q, 
sos Quz 


Ne 


he 
| 


we can transform the coefficient matrix of the origi 
| original state space e ion i 
the Jordan canonical form. T is given by j haniai 
1 0 0 
T= lim P.Q:=| aA, +a, 1 0 
Mi + ar, + as 2A, +a, 1 


- . Problem A-3-12 Obtain a transformation matrix T which transforms 
the following state space equation into the Jordan canonical form: 


X 6 1 O][»x 
X,|—|—12 0 1 Xo 
xs 8 0 ojl 


Solution The characteristic equation becomes 
M — 6A? + 124 — 8 =(A. —2)8 =. — Ay = 0 
Hence, from Eq. (3-53) T is given by 


r 1 0 0 


T= Ay +a 1 0 
LAI TAA, +a 2,+a, 1 

f 1 0 0 

=| —4 1 0 

4 —2 1 


PROBLEMS 


Problem B-3-1 Assume that A is an n X n matrix and the eigen- 

values of A involve k pairs of complex ones, namely 
assu As Nan ce 23 005 Das Uses eret: x ia Na 
where the first 2k eigenvalues are the k conjugate complex pairs and the last  — 2k 
eigenvalues are real. Assume also that they are all distinct. 
Show that it is possible to transform A into a diagonal matrix, or 
P;'AP, = diag Qa, Ay, . ++, Nes As en «+ + Àn) 

The columns corresponding to A, and A, of the transformation matrix P, consist 
of conjugate elements. Show also that by a second transformation P, it is possible 
to transform A, = Py'AP, into the following: 


à, 9 1 
G: 
Pz'A,P, =A; =| - G: 
Nokti 
0 Naj 
where 
G; = | = M 
TO; Oi 
Xi =; T joi 


This means that by the transformation P = P,P,, A is transformed into A», or 
P-!'AP = A, 
Show that the transformation matrix P is a real matrix. 
Problem B-3-2 Diagonalize the coefficient matrix of the following 


state space equation: 


Xi 0 1 0 Xi 
X,|2|4 2 1 X» 
s 3 3 34 Los 
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Problem B-3-3 Prove that if a 3 x 3real symmetric matrix A where 
4 A Aiz 
A =| az an sg (a; = ay) 
Azı As2 Q3; 


possesses eigenvalues X;, X; Ag, then the elements a;; will satisfy the following 
relationships: 


Qu —?i1.. — dg — , dg 
aiz ds» — Mi Q3 
Qi —h _ Mp ds 
13 Q, 33 — Ay 
Problem B-3-4 Find a matrix S such that 
SA = JS 
where 

2 1 3 zr, 
A= 4 I 2}, J=/0 2 0 
-34 H 2 00 1 


Find also a matrix T such that 


TA = JT 
where 
2 0 0 
J=/1 2 0 
L0 0 1 
n Problem B-3-5 Obtain a state space equation of a system defined 
y 


X + 9X + 23% + 15x —-0 
Simplify the coefficient matrix of the state space equation into the diagonal form. 


Problem B-3-6 Consider the following equation: 


(n) (n-1) 
X +a, x Tee +4,-.% + a,x =0 


By defining 
Je 
ye X + a,x 
Ys X + 4X + a,x 


(n-1) (n-2) 
Yac X FAX dceeedcbaax 


the given differential equation becomes 


y = Ay 
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where 
»i —a, 1 0 0 
Y2 — ds 0 1 0 
ys EE E : 
Yn-i —à,-, 0 0 1 
Yn =â; 0 0 0 
Define 
1 0 0 0 
a, 1 0 0 
B=| a a, if 0 
Qn-1 Gn-2 An-3 1 
Show that 
0 1 0 0 
0 0 1 0 
B-'AB = : 
0 0 0 1 
—, —Qn-1  —g-» —aQ 
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State Space Representation 


of Control Systems 


4-1 INTRODUCTION 


In state space analysis of systems we frequently wish to obtain a state space 
representation of the following scalar differential equation: 


(1) (n-1) (n) (n=1) 
X+a,x+--- + 44.X + a,x = Bou + bu +--+. + bn- Ú + bau 


where the coefficients a, and b; are constants, or conversely, we wish to trans- 
form state space equations of the form x = Ax + Bu into scalar differential 
equations. 

In this chapter we shall be mainly concerned with deriving state space 
equations from scalar differential equations and vice versa. Specifically, in 
Section 4-2 we shall derive state space representation of systems described by 
nth-order scalar differential equations. Section 4-3 discusses a converse 
problem of obtaining scalar differential equations from state space equations. 
In Section 4-4 we shall present the partial-fraction expansion technique for 
obtaining state space equations of transfer functions. Finally, Section 4-5 
deals with difference equations and their state space representation. 
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4-2 STATE SPACE REPRESENTATION OF SYSTEMS 


DESCRIBED BY SCALAR DIFFERENTIAL 
EQUATIONS 


In this section we shall derive state space representation of systems described 
by scalar differential equations. We shall limit our discussions only to linear 
Systems. Notice that if a state space equation and a scalar differential equa- 
tion are to represent the same physical system, then these two equations 
must have the same characteristic equation. It is clear that if the scalar 
differential equation is of the order n, then the corresponding state space 
equation must consist of n first-order differential equations. 

In deriving state space equations from scalar differential equations, we 
should neither divide the common terms in the left-hand side and right-hand 
side of the equation, nor cancel the common terms in the numerator and 
denominator of the transfer function. For example, consider the following 
equation which describes a forced dynamic system: 


X+3X+2x¥=u+u 
where x and u are the output and the input of the system, respectively. This 
differential equation corresponds to the transfer function 
X(s) . s+1 
U(s) s(s + 1s + 2) 
If the common terms s + 1 in the numerator and in the denominator are 
canceled, then the transfer function is reduced to 
X(s) | 1 
U(s)  s(s4-2) 
which corresponds to 
¥+2%¥ =u 
The original differential equation has the characteristic equation of 
M+ 347+ 27=0 
whereas the simplified differential equation has the characteristic equation of 
V+ 2Xx=0 


Hence the original third-order differential equation and the simplified second- 
order differential equation do not represent the same system. 

From the foregoing analysis, it is clear that in deriving a state space 
equation of the system we should not divide the left-hand side and right- 
hand side of the system’s scalar differential equation by the common factor. 
Keeping this in mind, we shall.derive state space representation of the 
following scalar differential equation: 


(n) (n-1) (n) (n-1) 
X-Fax4 e + aniž + ax = bou + but -:---+6,,4+5,u (41) 
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where the a, and the 5; are constants. Note that there is no loss in generality 
to assume the coefficient of the highest-order derivative of x to be unity. 
Any of the coefficients a; and b; may be zero. We shall first consider a simple 
case where b,— b, = <.. — b,.,— 0 and b,— bz&0. Then we shall 
treat a general case of Eq. (4-1). 


Nth-order scalar differential equation with simple forcing term 
Consider the following equation: 


(n) (n=1) 


x+ ax + +--+ + a 4x + a,x = bu (4-2) 
By letting 
Xy 
Ng SX; 
(n-1) 
Xa X 


Eq. (4-2) can be written as 


X = Ax + Bu (4-3) 
where 
F X 0 j 0 0 
E 0 9 0 0 
x= | 5 A= A B= 

| . . B . . 
pa 0 0 tee 1 0 

acu i] —. Apg H Qj 1 


Transformation of A into the Jordan canonical form will ease com- 
putational problems and will provide the insight important to system com- 
pensation. In the following discussion it is assumed that the eigenvalues of 
the characteristic equation are real and distinct. (Extension to the case where 
the given equation involves complex conjugate eigenvalues is obvious.) The 
case of multiple eigenvalues will be treated later. 

If x is transformed into y by the transformation matrix P, or x = Py, 
then the state space equation becomes 


x = Py = APy + Bu 


y = P-'APy + P! Bu 
= Dy + P-'Bu (4-4) 
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where D is a diagonal matrix. The forcing term Bu is thus transformed into 
P"! Bu. Since P~! Bu can be obtained as 


LY [oS 
0| impr" 
Na 
0 ney 
P Bu= P= . |= p 
u Ann y 
d LH. 


where A,; is the cofactor of the (7, j)th element of P, Eq. (4-4) can be 
written in the following form: 


š A, 
Ji = Mr + [P| u 
Fa = aa + GH (4-5) 


ae pe 
Vn An i iP| " 


Notice that in the presence of the forcing function the resulting transformed 
form is not unique in the sense that the coefficients of the forcing function 
terms depend on the choice of the transformation matrix P. This implies that 
if we choose P properly, each coefficient of the forcing function u can be 
made unity. We shall see this in the following. If we write 


v D 0 Q s Q i 
v, 0 w 0 0 l 
PBs = i = VF 
| v, 0 0 0 ved let 
where 
u D p 0 
0 wu 9 0 
WV zt LE © | = nonsingular matrix (4-6) 
0 0 O Un 


1 
1 
E] * 
1 
then Eq. (4-4) can be modified into 
y = Dy + VFu 
If a new variable z is defined by y — Vz, then 
z = V^ DVz + Fu (4—7) 
Since V is a diagonal matrix we have 
V'DV=D 
Equation (4-7) can therefore be simplified into 
z = Dz + Fu 
namely, 
F ^ 0 0 21 1 
2» . 0 As 0 29 1 
a Pree be ee) a 
Zn 0 0 Xn <n 1 
or 
Z -—Xz--u 
Zo = MZ + u 
; (4-9) 
Zn == XnZn T u 


Thus, it has been shown that by the transformations 

x = Py, y= Vz 
Eq. (4-2) can be transformed into the form given by Eq. (4-8) or Eq. (4-9). 
The initial condition terms are transformed into 

z(0) = V-'P-'x(0) 


Notice that z(r) is uniquely determined by x(0) and u(r). 
We shall next consider the case where A involves multiple eigenvalues. 


In this case the form given by Eq. (4-8) must be modified. Consider the 
following system: 


X—3X,£--3Mx—X-u (4-10) 


The three roots of the characteristic equation are equal, or A, = X, = A;. In 
Chapter 3 we have shown that the simplest form we can get of the homo- 
geneous equation of Eq. (4-10) with A, = X, = A, is 


X; AS" Ton 0 xi 
Ape qox, | X 
Xs O © A, Ixy 
For the nonhomogeneous equation we may write 
x, A, 1 O Tix [a 
Sel S10? Ay 3 X| + | 5 | [ul 
X, L0 0 Alix e) 


where a, b, and c are constants to be determined. By eliminating x, and x, 
from this last equation and writing x, — x we obtain 


X — 3A,.X + 30iX — Mix 
= ait + (b — 2axj)u + (Aja — X, b + cju 
The right-hand side of this equation must be equal to u. Therefore, 
aü + (b — 2ad, -- (Aja — àb + cu = u 
which yields 
a=b=0, c= 1 
Hence the simplest form of Eq. (4-10) can be written as 


[X A, 1 O X3 0 
Xib-—-]0. X 1 x| + | 0 [ud 
X, 0 O0 "ILE; 1 


Similarly, the simplest form of Eq. (4-3) when A involves, for example, 
seven eigenvalues (X;, X, Ay, X4, X4, Xo, Ar) is illustrated below: 


Vy A, 1 0 : olj y» 0 
V: 0 à 1 Js 0 
29 pe 4 d NO s 
Jı = | A. 1l da: [3E 0 [u] 
Js 10 X! ERES Js 1 
Js | A 0 Je 1 
»J Lo TESI 1 


(For the case of multiple roots, refer also to Section 4-4.) 
Example 4-1 Consider the system described by the following scalar 
differential equation: 
X --6x--1lx--6x—u (4-11) 
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Obtain the state space representation of the system equation in which the coefficient 
matrix of the state vector is in a diagonal form and the elements of the coefficient 
matrix of the forcing function u are unity. 


Let x; = x, x, = X, and x; = X. Then 
x = Ax + Bu 
where 
x1] r 0 1 0 0 
ZH à=] 0 0 1.|5 B=| 0 
| Xs —6 —li —6 1 


The eigenvalues of A are —1, —2, and —3. Let y;, Yz, y», be the principal coordi- 
nates. By means of the following transformation 


x = Py 
where 
eee | 1 1 1 1 
P=| A. A,|2|-—-1 —2 —3 (4-12) 
A] X1 OM 1 4 9 
we obtain ; 
5»! [-l 0 Oll» 0 
H = 0 —2 O|l| y; | - P71| O | [iu] 
LJ; L 0 0 —3) L»: 1 
or 


y = Dy + P-!Bu 
Once the roots of the characteristic equation are known it is not necessary to evalu- 
ate P-'AP, since P-'AP is a diagonal matrix with only eigenvalues appearing on the 
diagonal. It is necessary, however, to find P-! since the forcing function term is 
modified by P-!. P-! is obtained as 


3 fod 
P- =| —3 —4 —1 
1 t — i 
and 


i 
P-!Bu =| —1 | [u] 
E 
Equation (4-11) is thus transformed into 
Ji = —» + iu 
Jo = —2y»—u (4-13) 
y, = —3y, + iu 
To transform Eq..(4-13) into the form of Eq. (4—9), let 
y = Vz 


STATE SPACE REPRESENTATION OF CONTROL SYSTEMS 181 


or 
yı i 0 0]1rz 
y |-|O0 -1 0llzi 
Js 0 0 2 Z3 
Since 
2 0 0 
Va =|0 —1 0 
0 0 2 
and 
2 0 0 i 1 
V-1P-iBu —|0 —1 0||—1| [ug] —| 1 | [u] 
0 02 i 1 
Eq. (4-13) can be transformed into the following form: 
Z;-— —Z +u 
Z. = —2z, + u (4-14) 
Zz; = —3z + u 
or 
Z —1 0 O71 fz 1 
i. |= 0 —2 0|] z: |+} 1 | [a 
à 0 € —23liz 1 


The initial condition z(0) is given by 
z(0) = V-'P-'!x(0) 
For a given u the solution to Eq. (4-14) can be obtained easily. To obtain the 


solution in terms of x, we must transform z into x. In the present problem this can 
be done by the following transformation: 


Xi i +f rif 0 O]rz 
a oe -31]0; 2 o 
Xs i 4 Bile 0 "xh 


Nth-order scalar differential equation with forcing terms involv- 
ing derivatives of forcing function u We shall next consider state space 
representation of a system described by Eq. (4-1). In such a system we 
should choose a set of variables such that in the state space the motion does 
not exhibit infinite jumps. Determination of a suitable state vector is pos- 
sible, since the derivatives of x and u provide all the necessary information 
for the determination of the future trajectory of such a system. The com- 
ponents of each possible state vector consist of suitable combinations of 
derivatives of various orders of x and u. Several methods are available for 


finding possible state vectors for such a case. We shall discuss three methods 
in this section and another method in Section 4-4. 


Method 1 We shall demonstrate the first method by use of 
a third-order system. Then, we shall extend the method to the nth-order 
system. 


Consider the following equation: 
X + aX + a,X + ax = bii + bi + bù + byu 
In terms of the operator p = d/dt, this equation can be written as 
(p° + a,p* + asp + as)x = (bp? + bip’ + bap + byu 
or 
P(x — bu) + p*(a,x — bu) + plax — bau) + (a,x — bu) = 0 
from which 


l 
x = bu + 3 (b,u — a,x) + $ (bu — ax) + JOu — A,X) 


Figure 4-1 Analog computer diagram for third-order differential equation X + až 
+ až + ax = bū + bü + bot + bzu. 


An analog computer diagram for this last equation is shown in Fig. 4-1. 
The output signals from the integrators constitute a set of state variables. 
Let us define the output signals from the integrators as —-x,, x,, and — x, as 
shown in Fig. 4-1. Then x,, x., and x, area set of state variables for the pres- 
ent system. In terms of the original variables x and u, the state variables 


X,, Xa and x, are given by 
x,—x— bu 
x, = p(x — bu) + (a,x — bu) (4-15) 
x, = p*(x — bu) + p(a,x — bu) + (a,x — bu) 
Note that 
px, = —(asx — bu) (4-16) 


In terms of matrix representation, the state vector x where 


x 
x= |% 
Xs 
can be written as 
Xi] 1, O -Ora —b, 0 "BENT 
Xie : SEDI — — 81) 
Xs à; od; UWE —b, —b, ile. 


Equations (4-15) and Eq. (4-16) may be modified into 
x, = xX — dou 
X, =X, + a,x — bu (4-17) 
X3 = X, + a,x — bu 
X3 = —a,x + b;u 
Eliminating x from Eq. (4-17) by use of the relationship x = x, + bu we 
obtain 


xX, = —4,X, + % =F (b, = a, by)u 


Xo = —a,X, + X; + (b, — abo)u 
X,— —a,x, + (b, — a,bj)u 
or 
Xi —a, 1 O][xj] fb, — a,b, 
xX,|2|—a, 0 1| x2| + |b. — asbo| [u] (4-18) 
Xs —a, 0 Olla ib —ab, 
where x, = x — byu 


(Notice that although the state vector x may be continuous, the original 
variable x may have discontinuities at points where the value of w jumps.) 
Equation (4-18) may be written as 


x = Cx + Hu (4-19) 
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By use of an appropriate transformation 
x= Py 
Eq. (4-19) can be modified into 
y = P-!'CPy + P-'Hu 


where P-'CP is in the Jordan canonical form; in particular if the eigenvalues 
of the characteristic equation are distinct, it is in the diagonal form. If 
necessary, the coefficient matrix of u can be normalized by use of an ad- 
ditional transformation 


y= Vz 


where V is defined by Eq. (4-6). Then the simplified equation of the distinct 
eigenvalue system is 


z= Dz 4+ Fu (4-20) 
where 
Ni 0 1 
x 2| 
p ; B=} * 
0 An 1 
The initial condition terms are transformed into 
z(0) = V-! P-! x(0) (4—21) 
where x(0) is given by 
[X(0) 1 © Ol[x(0) —b, 0 0 ] fu(O) 
x(0) = |x,(0)} = |a, 1 0||(0)|+41—b; —b, 0 | | (0) 
| x3(0) a, a, lj|x(0) —b, —b, —b,} uO) 
Example 4-2 Given the system equation 
X + 6X + 11% + 6x = ü + 8ü + 174 + 8u (4-22) 


obtain the state space representation in the form of Eq. (4-20). 


By use of the transformation defined by Eq. (4-15), Eq. (4-22) becomes as 
follows: 


X, —6 1 0 Xi 2 
X;|2|—11 0 1j||x 4| 6 |] (4-23) 
Xs —6 0 Ojlx 2 


The roots of the characteristic equation are —1, —2, and —3. Let us transform 
Eq. (4-23) into an equation in y by the following transformation: 


x — Py 
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where P is defined by matrix (3-21) and given by 


Jw. I 
P=|5 4 3 
6 3 2 
P-! is then obtained as 
—1 1 —1 
Po = -f 8 —4 2 
—9 3 —1 
Hence 
»] [231 9 Qf = 47/2 
yo|=| 0-2 0 E +| —4 2 —]1 6 | [u] 
Xs 0 9 —3 Js $i z 2 
—1 0 Olly —1 
=| 0 —2 0 | yo | + 2 | [ul (4-24) 
0 0 —3]JL»: 1 
By a further transformation 
y — Vz 
where 
—1 00 
V= 0 2 10 
00 1 - 


Eq. (4-24) can be simplified, giving 


P^ —1 0 07 fz: 1 
4 |= | 0 —2 UEEZSEZSES BU 
Z3 0 0 —3] Lz; 1 


The initial condition z(0) is defined by Eq. (4-21). 
For a given u, the response z can be computed easily. Once z is found, then 
the response x in the original variable can be obtained immediately. Since 


x — PVz 
1 1 i|[-—1 0 Olly =] 2 l][z 
=|5 4 3 | 02 Ojiz |=|—-5 8 3||Iz 
6 3 2 00 IJLz; —6 6 2]iz 


and 
x= X; + dou 
we obtain x in terms of 2), z;, z,, and u as follows: 
x= —2, +227, +23 +u 


We shall postpone general discussions of system responses, or solutions of state 
space equations to Chapter 6. 


Let us next consider a general case where the system differential equa- 
tion is given by the following equation: 


(n) (n-1) 


x4d- AX + +++ 4X4 ax 
(n) (n-1) 
= byu + bud bau bau (4-25) 
where the coefficients a; and b; are constants. Let us define a set of state 
variables x,, Xə, ... , x, in the following way: 
X, l 0 -... 0 x —b, 0 e 0 u 
Xs a, 1 0 x —b, —b, 0 ù 
= -— ° 
(n-1) ; (n-1) 
Xn Ok. ‘Oe os l x —b,., =r —b, u 
(4-26) 
which is equivalent to 
x,—x-—bu 
Xo =X, +a,x— bu 
X, — X, + a,x — bau 
(4-27) 
Xn = Xn-1 dg aX — bau 
X, = —a,x + bu 
Eliminating x from Eq. (4-27) by use of the relationship 
x=x,+ du (4-28) 
we obtain the following equation: 
zi —4, Ll 0 ss Q 26 b, — a,b, 
Xs —d, 0 1 0 Xe b, — a,b, 
=|. : " : . F — 5 [u] 
baer =—@,., Q 0, 9 Ix; b,., — An-1 Bo 
Ss — an G10) 2:5 © Xe b, — a,b, 
(4-29) 


where x, and x, the original variable of the equation, are related by Eq. (4-28). 


(1-1) 
The initial condition x(0) can be obtained in terms of x(0), X(0),....x (0) 
(1-1) 
and u(0), (0), . . ., (0) by use of Eq. (4-26). 
If the eigenvalues X4, Xe, ..., An of the n X n matrix in Eq. (4-29) are 
distinct, the system equation can be simplified by the use of the following 
transformation matrix P: 


1 1 
icd Ay + d, 
Das Ai + a + ds As + ae + ds 
AIC BAT + ees + Any Bot any tone + Qn; 
2 7 
An + d, | 


AÀ "b P "rr Aa 


i 


AR da XR 4 


(For the derivation of this transformation matrix, refer to Prob. A-3-10.) 
As an example, consider the system in Example 4-2. For this system 
since A, = —1, A. = —2, X, = —3, and a, = 6, a, = 11, a, = 6, we obtain 


1 1 1 
P= Ay + d, Ao + di As + d; 
MEH AA, + do Aida. d-4. M Ha + ds 


Eel d 
=s 4 3 
6 3 2 


This is the same P matrix as obtained in Example 4-2. If the eigenvalues of 
the n x n matrix in Eq. (4-29) involve multiple eigenvalues X, of multiplicity 
m, then the transformation matrix becomes as follows: The elements in the 
jth column (2 <j «i m) is (j — 1)th derivative (with respect to X,) of the 
corresponding elements in the first column divided by (j — 1)!. For example, 
if the first three eigenvalues are equal and other eigenvalues are different 
from each other, then P becomes as follows: 
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F 1 0 0 I 
^, + Qa, 1 0 As + a; 
NE + aA, + a 2r, + a, 1 Aic QA. +a, ce 
| 8104) 3A + 2a,rA, + a; 34, + a, £304) 
pa j à T : 
d 1 d 
En-2(à1) dx, 82-2) OT dx 8:0) Sn-2(Au) 
d 1 d? 
L £a-1Qu) dx, 8:0) 21 dxi 8:09) £a-104) 
1 
Xn = a, 
Ma + dis + a 
EAn) 
En-2(An) 
Bn-1(n) 
where 


Sn-i(A) =A) + aA"? Tx Qn + 04i 
(For the case of multiple eigenvalue systems refer to Prob. A-3-11.) 


Method 2 We shall begin by demonstrating the second 
method by use of a third-order system. Consider the following equation: 


X+a¥+a,X+a,x=bit+b,i+ bú + bu (4-30) 
We wish to transform Eq. (4-30) into the following form: 
X 0 1 01 fx, ĉi 

| = 0 0 1 |} x| + | c, | [uJ (4-31) 
X3 —à, —4aà, —a,jix C3 


This is possible if we define 
ži =X, + cu 
X, = x, + c,u (4-32) 


Žž; = —d,X, — A,X, — A,X; + C;u 
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and 
x= x, + cou (4-33) 


The problem then becomes that of determining constants c, ¢,, C2» and c, in 
terms of the a's and b's. From Eq. (4-32) and Eq. (4-33), we obtain for x 
X=X, + Cot + cu (4-34) 
Similarly for X 
X = x, + ct + cuu + cu (4-35) 
and for X 
X = —a,Xx, — A,X, — A,X; + col + c ü + Cou + CU (4—36) 
Multiplying Eq. (4-35) by a,, Eq. (4-34) by a,, and Eq. (4-33) by a; and then 
adding these to Eq. (4-36), we obtain 
X¥+a,X+ a,x + a,x 
= —4,X, — A,X, — A,X; + coi + C Üü + Cú + c3u 
+ a,X3 + a, Cott + ac, + a,Cou 
+ A,X, + AgCott + a,C,U + AX, + A3Cou 
= Coil + (c, + ,Co)it + (e + a,c, + a5 09) 
+ (es + 4,0, + Age, + a5 69)u 
= bü + bit + bu + bzu 
Hence 
(Co — by) + (Cy + 1 Co — bi) + (es + A, 0; + Gy Co — b) 
+ (C3 + a,0, + a,c, + a3Co — b;)u = 0 
In order to determine co, c;, Ca, and c;, let us equate coefficients of ü, ü, ú, 
and u of this equation to zero, respectively. Then 
c; — b, —0 
c, + aj, —b,=0 
€, + a,c; + a,c, — b, = 0 
€; + a,c, + a,c, + a;c, — b = 0 
it follows that 
€, = b, 
c, = b, —a,b 
1 1 1%0 (4-37) 
€, = b, — a,b, — a,(b, — a,b) 
€; = b, — a;b, — (a, — ai)(b, — a,b) — a,(b, — abo) 
Hence we see that Eq. (4-30) can be transformed into Eq. (4-31) by means of 


the transformation given by Eq. (4-32), where c,, c,, and c, are defined by 
Eq. (4-37). The initial conditions are 


x,(0) = x(0) — c,u(0) 
x4(0) = x(0) — c,u(0) — c,u(0) (4-38) 
x40) = X(0) — ey i(0) — c u(0) — c,u(0) 


Note that a set of state variables obtainable by Method 1 and that 
obtainable by Method 2 are different. In the case of the third-order system 
the set of state variables obtained by Method 2 can be written as follows: 


X,=xX— eu 
X, = X— c8 — Cu 
X; = X — Coli — c, — Cou 
In terms of the coefficients a; and bj, we have 
x, = x — bu 
X, = p(x — byu) + (a,b, — b,)u (4-39) 
x, = p'(x — b,u) + p(a,b, — b,)u — a (a bs — b,)u + (aabo — ba)u 
where p = d/dt. [Compare Eq. (4-39) with Eq. (4-15). The two sets of 
equations are different from each other. As a matter of course, however, 
both sets of variables x,, x., and x, are valid sets of state variables for 
the system considered.] 
Example 4-3 Given the following system: 
X + 6X + 11x + 6x = iü + 84 + 174 + 8u 


obtain the state space equation in the form of Eq. (4-20). 


By use of Eq. (4-39) the given equation can be transformed into the following 
form: ; 


ES 0 1 fx 7] 
X. |= 0 0 li|[xe |+| —6} {uJ (4-40) 
Xs —6 —11 —6||»x ] | 
By means of the transformation 
x; 1 1 iir- 0 irz 
x,|2|—1 —2 —3 0 2 O]) 2. (4-41) 
Xs 1 4 9 0 0 1)L2; 
Eq. (4-40) can be transformed into 
E -1 0 07 fz 1 
2, |= 0 —2 Ojlzes |+| 1 | [a] 


23 0 0 —3 I 


The initial condition z(0) can be obtained from Eq. (4-37), Eq. (4-38), and 
Eq. (4-41). The solution x to the original differential equation is then obtained as 


x= Xx + bou = —z, +22. +23 +u 


t 
w 


Example 4-4 Let us obtain a state space equation of the following 
system: 


d SEL 
(p 4- D(p + Dp + 3x = (p + Du, pe (4-42) 


By letting 
x=xX 
X, = Xs + Cu 
X, = X; + Cou 
Xs —6x, — 11x: — 6X; + esu 


the given differential equation can be written as follows: 


6 0 1 01[x €i 
Xslbel 39 0 1il x |+| c| {4 
Xs —6 —1n1 —6] Lx: Cs 
Constants cı, c;, and c, are obtained as 
e; = 0, = 1, Cs = —5 
Hence we obtain 
Xi 0 1 Osi 0 
X. |= 0 0 1 s + 1 | [uJ 
Xs —6 -11 —6] lxs —5 
which can be simplified into 
dr —1 0 0 yı 0 
Ja |= 0 —2 oli». || 1 | Ud (4-43) 
l| Lo o -3lLx 1 
where x and y are related by 
xi 1 1 npo ofai 
x |-|-1 -2 -3||0 1 0||» 
Xs 1 4 oilo 0 =i] Ls 


Equation (4-43) indicates that the motion of y, does not depend on u. mn 
be clear, since in Eq. (4—42) the factor (p + 1) appears on both sides O abs 
equation. (Notice that if we cancel the terms (p + 1) on both sides of Eq. ( : ) 
then the order of the equation becomes second which is different from x order 
of the original system. Since the original equation is given as à jen te — 
tion, the corresponding state space equation must consist of three first-orde 
differential equations.) 

We can easily extend the present analysis to a general case of the nth- 


order system: 
(n) (n-1) n) n 


( D 
X C= Sect + be u + biu 
X + ax -e + aniž + dax bou + bu + 1 
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By defining the follgwing set of equations: 
X, =X, + cu 


XQ = X3 + Cou 


Xa-i = Xn + Cn-,U 
Xn = —44XQ; — An- X; — +--+ — A Xn + Cnu 
and 
x= x, + cu 
we can transform Eq. (4-44) into the following form: 


x, 0 1 0 =. O7f x e 
Kg 0 0 1 ee 0 Xs ts 
= P ‘ A : P SEE er [u] 
Xe 0 0 0 1 SER C3 
$t Gy iai —Gng c —G,|| x Le, 
(4-45) 
with initial conditions 
x,(0) = x(0) — c,u(0) 
x,(0) = x(0) — c,2(0) — c,u(0) 
x,(0) = ¥(0) — c,(0) — c, (0) — c,u(0) 
(n-1) (n-1) 
X«(0) = x(0) — cou (0) — +++ — c,.,3(0) — cn- u(0) 
where Cy. Ci, Cy, ..., c, are determined from 
€, = b, 
€, + a ey — b, 
€, + a0, 4- a0, — b, 
(4-46) 


Cs + 4,0, + A.C, + d4€, — b, 


Ca t+ GC, H ee QAn-1C; F Q4C, = b, 


Equation (4-45) can be simplified, if necessary, by the straightforward 
technique discussed in this section. 


Method 3 We shall demonstrate the third method again by 
use of a third-order system. The result can be easily extended to the nth- 
order system. 
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Consider the following equation: 
ï+ aX + a,X--a,x = biti + bü + b, + bu (4-47) 


We shall show that one possible state space representation of this system is 
given by 


" 0 1 wira 01 
Alel d 5€ ile 4 [u] (4-48) 
x —a; —a, —a,||X; 1 


where the original dependent variable x is given by 
x = (b, — a,by)x, + (ba — azbo)X, + (b, — a, bo)xs + bou 


To demonstrate this possibility we shall consider an analog computer 
simulation of the original system given by Eq. (4-47). Suppose we define 
E " ee 
u dE ME did 3y (4-49) 
x = b. by by + by 


Then elimination of y and its derivatives from Eq. (4-49) results in Eq. 
(4-47). Figure 4-2 shows realization of Eq. (4-47) or Eq. (4-49). Let us define 


Figure 4-2 Analog computer diagram for third-order system defined by Eq. (4-47) 
or Ea. (4-49). 


MX 
y-2x (4-50) 
jx 
Then from Eq. (4-49) and Eq. (4-50), we obtain 
Meme 
Xy SX 
X3 = —ad,X,— dX, — a,x, + u 


and 
x = (b, — ajby)x, + (by — a,b), + (b, — a,b.)x3 + bou 
Hence Eq. (4-48) results. As seen from Eq. (4-48) essentially we have 
modified the original equation (4-47) into 
Vrayt+aytay=u 
The original variable x and the new variable y are related by 
by + bY + by + by =x 


Equation (4-48) can be simplified by techniques presented in this section. 
Note that the analog computer diagram shown in Fig. 4-2 represents also 
a simulation of the following transfer function: 


X(s) _ bas? + bis? + bas + b, 
Uls) s +as+a,s+a, 
provided that the initial conditions x,(0), x2(0), and x,(0) are set equal to zero. 


Extension of the present method to the nth-order System is straight- 
forward; therefore, details are left to the reader. 


Linear time-varying system In presenting the preceding three 
methods we limited our systems to linear time-invariant systems. These 
methods, however, can be extended to treat linear time-varying systems. 
Since extension of the methods to linear time-varying systems is straight- 
forward, we do not discuss this matter in detail. We present only the 
extension of Method 2 to linear time-varying systems. 

Consider the foliowing system: 


Bache ^ Sends es Bel B VE cas ob btu (4-51) 


where the a,(t) and b,(t) are differentiable as many times as we need. By 
defining the following set of equations: 


xX, = X, + ¢,(Hu 


X, = X; + ¢,(t)u 


Xn-1 = Xn Ca. (f)u 


Xn = —a,(t)x, — a, .(t)xs — --- — a(t)Xn-1 — a (t)Xn + {tu 
and 
x= xX, + cu (4-52) 
we can transform Eq. (4-51) into the following form: 
Xi 0 1 0 es 0 xi [| e(t) 
Xs 0 0 l e 0 Xs ex) 
= ; ; ; ; ; du ; [u] 
es 0 0 © we i xal | aC) 
Xn —4,(t) —@n-\(t) —an-(f) +++ —a,(t) Xn |. L eat) |j 
(4-53) 
The functions c(t), c(t), us S c,(t), which are combinations of the coef- 


ficients a,(t), b(t), and their derivatives can be determined by successively 
eliminating x,, Xə, ..., Xn from Eq. (4-52) and Eq. (4-53) and requiring 
that the resulting rth-order scalar differential equation be identical with 
Eq. (4-51). For example, if n = 2 we obtain 


e«t) = b(t) 
c(t) = b(t) — a(t) b(t) — 2b,(t) 
e«t) = ba(t) — a(t )bo(t) — a,(t)[b,(t) — ailt b(t) — 2540] 
— b(t) + à t)b(t) + b(t) 
Similarly, the c,(t) for the general case can be obtained as follows: 
Cot) = b(t) 


SRE bid Pn ante a, i (Det) 


Notice that because of the presence of derivatives of a,(t) and 5;(t), the ex- 
pressions for the state variables x,(t), x(t), .. . , Xa(t) become complicated 
except in simple cases where n is small. In the case where n = 2 we have 


x, x — e«t)u 
X, = X — C(t )ú — c(t)u — é,(t)u 
In terms of the a,(t) and 5,(t), x, and x, are obtained as follows: 
x,—x-— bt)u 
X, = x — by(t)u — b,(t)u + a,(t)bo(t)u + ba(t )u 
As in the case of linear time-invariant systems, different sets of state variables 
can be obtained if different methods are used for deriving possible sets of 
state variables. 
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4-39 TRANSFORMING STATE SPACE EQUATIONS 
INTO SCALAR DIFFERENTIAL EQUATIONS 


In this section we shall present techniques for deriving scalar differential 
equations which correspond to given state space equations. We shall show 
that for a given state space equation or vector matrix differential equation 


X = Ax + Bu (4-54) 


where x is an 7 vector, u is a scalar, A is an n X n constant matrix, and B 
is an n X 1 constant matrix (an n vector), there corresponds a number of 
expressions of the form: 

(n) (n-1) " (n) (n-1) 

Y+ay + -ee + ara) + any = but biu + --- + br-iù + bnt 


(4-55) 


Notice, however, that in each different expression the variable y is different. 

In the following we shall first demonstrate that a state space equation 
described by Eq. (4-54) can be transformed into an equation of the form 
given by Eq. (4-55). Then we shall show that by a suitable choice of the 
variable y Eq. (4-54) may be transformed into 


(n) (n-1) 
YEY F E Faf + any =u 


9 Li L * n) (n-1) 
Deriving scalar differential equation of the form y + a,y +--- 
À g (n) (n-1) 
+ ds + ay = bu+bu+--- + b, uu + 6,u from state space equation 


of the form X = Ax + Bu In order to derive a scalar differential equation 
which corresponds to a given state space equation let us first note that if 
the eigenvalues of A are distinct, or if the multiplicity of any of the multiple 
eigenvectors of A is the same as that of the corresponding multiple eigen- 
value of A, then there exists a nonsingular matrix T such that x — Ty will 
transform Eq. (4-54) into 


y = T^'ATy + T^! Bu 


where 
0 1 0 0 0 
0 0 1 0 0 
TAT = 
0 0 0 0 1 
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and 


Cn-1 


Cn 


The constants a, can be determined from knowledge of the eigenvalues of 
A. Once the c, are determined by computing T-'B we can determine the b; 
from Eq. (4-46). i 

We now give two simple examples to illustrate the procedure for deriving 
corresponding scalar differential equations from state space equations of the 
form given by Eq. (4-54). 

Example 4-5 Consider the system shown in Fig. 4-3. The system 

can be described by 


x = Ax + Bu 
where 
2; —3 —2 -1 1 
x =| x. |, A= 1 0 —1|, B=/| 0 
Xs 0 0 —4 0 


Define y such that it is a linear function of x,, X», and x3, or 
xi 
y=[a b |x 
Xs 
where a, b, and c are constants. Derive a scalar differential equation in y describing 


the system dynamics. (Note that y is not a unique combination of x;, x», and x;. 
Note also that some, but not all, of the constants a, b, and c may be zero.) 


ži =- 3X- 2X2-X%3+U 


X27 X- X 


X37 -4 Xs 


Figure 4-3 Control system. 


We shall obtain the system's differential equation in the following form: 
Vrayrayt+ay= boit + by ii + bd + bu 


Notice that 
—-3-xrx —2 —1 
1 7 ENS! = —(A3 + 7A? + 14X + 8) 
0 0 —4—2A 


= —(A + 1)(X + 2). + 4) 
Let us define y as the first component of the following y: 


y = Tx 
or 
A 4 B «px; 
ye ema ves Ws 
Vs Ske i) ti 
where T-! should be chosen to be nonsingular. We determine T-! such that 
» 0 1 0 Vi €, 
2 |= 0 0 1 Xs |+| ce | [u] 
Js -8 —14 —7|l» C3 
where c;, co, and €; are constants. To determine T-!, we solve 
a b ¢)f—3 —2. —] 0 1 Olfa b c 
def 1 0 -1l/= 0 0 Hig e f 
g hi 0 0 —4 —8 —14 —7]lege h i 


for a, b, c, d, e, f, g, h, and i. Note that T- is not uniquely determined. One possi- 
ble solution (a nonsingular matrix T~?) is given by 


0 1 1 
T- = 1 0 —5 
—3 —2 19 
For this T-!, constants C1, Co, and c, are determined as ` 
[^ 0 1 1 1 0 
Cc. |= 1 0 —5|10|-2 1 
C3 =3 =2 > 19: 10 -3 
Noting that c, = 0 because Y = yy, we obtain from Eq. (4-46) 
b, =e =0 


bi=c + ac, =0 
b. = Cz + a,c; + aC = 
bs = C3 + a,c + Ane, + azco = 4 
Hence the desired scalar differential equation in y, where y = y, = x, + x,, is 
YV+7H+149+8y=un4+4u 


or 
d 
PUFFI RO= pF. pes 
Example 4-6 Given the following state space equation: 
X = Ax + Bu (4-56) 
where 
X; 0 1 0 ‘eal 
x=| xXx], A= 3 Que 3. B-| 2 
Xs —12 —7 —6 t. 84 


obtain the corresponding scalar differential equation of the form 

y c dy c ay c ay = by + bü + bet + bzu 

The three eigenvalues of A are —1, —2, and —3. We can determine a,, as, and 

a; from the following equation: 

AS + ad? H-agX +a, =(A+ 1)(A + 2)(r + 3) 
The values of a,, a», and a, are found to be 

a, = 6, a = 11, a; = 6 

By a suitable nonsingular transformation x = Ty, Eq. (4-56) can be transformed 
into 


y = T-ATy + T-'Bu (4-57) 
where 
0 1 07 
~TAAT=| 0 0 1 
zd =H 5] 


The matrix T-! can be determined by the method discussed in Example 4-5. Here 
we shall, however, find T-! by a different method. Let J be the Jordan canonical 
form of A. Then there exists a nonsingular matrix P, such that Pī! AP, = J. Note 
also that there exists a nonsingular matrix P, such that P;'(T-! AT)P, = J. 
Hence, P;'AP, = P;'T-'ATP,, or 


A = P,P;'T-ATP,P;! = (TP.P;)-'A(TP.P;) 


Therefore, we can obtain T-! from 
T^! = P,P;! 
where 
; = matrix defined by Eq. (3-18) or matrix (3-21) 
P, — matrix defined by Eq. (3-37) 
(For multiple eigenvalue systems we must make necessary modifications to P, and 
P,. Notice that a nonsingular matrix T does not exist unless the multiplicity of 
any of the multiple eigenvectors of A is the same as that of the corresponding 
multiple eigenvalue of A.) 
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For the present system P, and Py! are (Refer to Prob. A-3-1.): 


1 1 1 $ $ 1 
p=- -2 —3|, Př=|—3 —2 -1 
DEN 4 9 $ 3 1 
Equation (4-57) can now be written as 
Vi 0 1 0 Ty 3 
ži |= 0 0 1 ys | T | —6 | [u] 
Js —6 —11 —6] L»; 19 


Note that y = y, + cot. By use of Eq. (4-46) we obtain 
by = Co 
b, = 64 + ara = 3 + 66 
bs = cs + a410, + asc, = 12 + 11e, 
b, = C3 + aC + gC, + azco = 16 + 6c 


Therefore, a scalar differential equation equivalent to Eq. (4-56) is given by 
pr Opt 11y + 6y = coü + (3 + 6c)ü + (12 + 11) + (16 + 6c)u 
or 
d? ; d? d x ; 
ap O — cou) + 6^5 0 — Cy) + Ua (y — cou) + 6(y — cou) = 3 + 124 + 16u 
Here, c, is an arbitrary constant. In terms of y,, this scalar differential equation 
becomes 
Yi + OF, + 114 + 6y, = 3ü + 1240 +16u 
Notice that y, and x are related by the following equation: 
y, = 4x, + 2x2 + X; 

If we choose a different quantity as the dependent variable in the scalar differential 


equation, then the resulting scalar differential equation is, of course, different. In 
fact, if we make the following transformation: 


X —i 2 31 Cz, 
4| = 2 3 —34|| z 
x] 3 -0- Uz 
then the scalar differential equation (in terms of z,) corresponding to Eq. (4-56) 
becomes 
Z; 4-62, + 112; + 625, =u 


(Refer to Prob. A-4-10.) This is the simplest form of the scalar differential equation 
we can obtain for the present state space equation. Such a transformation into the 
simplest form may be made if we make a suitable choice of the dependent variable 
of the scalar differential equation. 


In the following we shall discuss a method for transforming a state 
space equation into the simplest scalar differential equation. 
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n (n-1) 
Deriving scalar differential equation of the form y T ay + 
«++ + daa + any = u from state space equation of the form x = Ax 
+ Bu (4-1) Consider the system 
x = Ax + Bu (4-58) 
where x is an n vector, u is a scalar, Aisann X n constant matrix, and B 
is an n X 1 constant matrix. We shall show that by means of a suitable 
transformation 
x — Hy (H: nonsingular) 


we may be able to transform Eq. (4-58) into the following form: 


ý = H^AHy + H^! Bu (4-53) 
where 
0 O0 sss © =f; l 
10 = 0 —an-ı 0 
soe e 0 
Homel 5 m 9 maa aee 
0 0 ] —a 0 
and 
—4, = (h,, A” B) 


—a,., = (hy, A*B 
an-ı = (hy ) (4-60) 
—a, = (h,, A” B) 
In Eq. (4-60) h; is the ith row of H^' and (h,, A"B) denotes the inner product 
of h; and A"B. By eliminating y,, s -+ + s Yn-ı from Eq. (4-59) we obtain 
n n-1) (n-2) . 
Ya + ay. + ayn + +t F dais + days =U (4-61) 
It can be saown that such a transformation matrix H is given by a matrix 
consisting of column vectors B, AB, A* B, o 4,57 B, or 


H = [B ! AB | AB i --- ; A™'B] 
provided n column vectors are linearly independent. (See Prob. A-4-9.) 
Notice that if we define 


Zi Yn 
Z; Va 
z= = 
(n=1) 
Za Yn 


then Eq. (4-59) can be transformed into | 
B 0 l Ü «ws 8 


£i 0 
A 0 0 E = EU 0 | 
= + [u] 
Zu 0 0 0 see 1 Zn=1 0 
Zi —a, —flicy —An-» ia —a, Zn l 


(4-62) 


This means that Eq. (4-58) may be transformed into Eq. (4-62), or its 
equivalence 


(n) (n-1) 


Zi + iZi tee) + anah + dQZ,; =U 


Example 4-7 Consider the following system: 


x —6 1 0 Xi 2 
žə |=| —11 0 1 Xa |+| 6 | [u] 
Xs —6 0 OJLx,; 2 


We shall illustrate that this state space equation can be transformed into the scalar 
differential equation 


Vraytayt ay =u 
Since for this system 


P —6] 16 
B-—|6|, AB — EJ A°B =| 54 
Lg —12 36 
we obtain 
2  —6 16 
H-—|6 —20 54 
2 —12 36 
Let 
X — Hy 


then we obtain 


y = H-'AHy + H-!Bu 
where 
00 -6 
HAH =|] 0 —11 
D 1 —6 


and 
1 
9 -& x2 1 
= 27 kj 
HB = > —$5 5 6 |=| 0 
$ 1 
+ crm [e 
Rewriting 
Vy 00 —6 JA 1 
yist O -H ys |+} 0 | [a] 
Jz 0 1 —6 Js 0 |! 


Elimination of y, and y, from the foregoing equation results in 
Js 69. + 101», + 6y; = 
where y, is related to x,, x», and x; by the following equation: 


ys = 4x, — 35, + ls 


2 2 


4-4 PARTIAL-FRACTION EXPANSION TECHNIQUE 
(4-2, 4-3) 


The partial-fraction expansion technique can be used for obtaining state 
space representation of transfer functions or scalar differential equations. 
By this technique we can obtain directly the state space equation in which 
the coefficient matrix of the state vector is in the Jordan canonical form. In 
applying the partial-fraction expansion technique to the transfer function, it 
is necessary to factor the denominator. 

The procedure for obtaining a state space equation for the differential 


equation 
(n) (n=1) (n) (1-1) 
X+ 4X + +++ + aniž + anx = byu + byu + 


L 


4b aud bu 


by the partial-fraction expansion technique is to assume that the initial 
conditions are zero and write the corresponding transfer function. The 
assumption of zero initial conditions is permitted because we are dealing with 
linear systems, and state space representation of linear differential systems 
does not depend on the initial conditions. A state space equation for the 
differential equation with the zero initial conditions (or the corresponding 
transfer function) is also a state space equation for the differential equation 
with the nonzero initial conditions. 

In this book, therefore, whenever we use the partial-fraction expansion 
technique we first assume the zero initial conditions in the differential 
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equation and obtain the transfer function. Then we derive the corresponding 
state space equation. Finally, we relax the zero initial conditions to arbitrary 
initial conditions. 


System with simple forcing function We shall first consider a 
simple case where Eq. (4-63) assumes the following form: 
Q) — (n=1) . 
X--a xa 4X4 a,x = bu 
The transfer function corresponding to this equation is 


A o ae 
U(s) (S — XS — Ne) > =e (S — Xa) 


We shall transform this transfer function into a first-order vector matrix 
differential equation, or a state space equation. We shall first consider 
the transfer function with distinct poles and then consider that with multiple 
poles. 


State space representation of transfér function with distinct 
poles Let us denote the transfer function as W(s) or 


X(s) b 
ld c ——— o ——— 
(s) U(s) s tas" +--+ +a,_,;5+ an 
b 
a 4-64 
B—RE-—A p ids 
We assume here that the poles A,, A», ..., An are distinct. The original 


scalar differential equation from which Eq. (4-64) is obtained can be repre- 
sented in a state space by Eq. (4-8) or Eq. (4-9). If each of Eq. (4-9) is 
Laplace transformed assuming zero initial conditions, then we obtain using 


variables Pis ys, . . . , Yn rather than variables Zi; z;, ... , Zn 
Fus) . 4 
U(s) s—A 
YX45). 1 
U(s) s—2A (4-65) 
Y,.s 1l 
U(s) s—X 


Hence we see that a simplified! state space representation of Eq. (4-64) is 


Here, simplified state space representation means that, in the state space equation, the 
coefficient matrix of the state vector is in the Jordan canonical form (which includes the 
diagonal form as a special case) and the elements of the coefficient matrix of the forcing 
function u are either unity or zero. 
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essentially a representation of the transfer function by n simultaneous first- 
order transfer functions which can be obtained by dividing Eq. (4-64) into 
n first-order systems, as shown in Fig. 4-4. Since Eq. (4-64) and its state 
space representation, Eq. (4-65), are equivalent, the former can be derived 
from the latter, and vice versa. Essentially, Eq. (4-64) and Eq. (4-65) are 
related by a transformation matrix P. We can find the necessary transfor- 
mation matrix P by expanding the transfer function into partial-fractions. 


b 
TETTE 


Figure 4-4 An nth-order transfer function and its representation by a sum of n first-order 
transfer functions. 


Let us define: 


X,(s) = X(s) 
X,(s) = sX(s) 


X(s) e S77 XS) 


If the following transformation is made 


X(s) = PY(s) 

then 
X(s)  pY(s) 4-66 
U^ PS O (4-66) 


Since 


It is seen that each element of P is obtained directly from the coefücients of 


e = Ws) the partial-fraction expansions of W(s), sW(s), ..., s"^'* W(s). The simplified 
T state space representation of Eq. (4-64) then becomes 
US = sW(s) y = P APy + Fu 
6) where 
"EN: 0 1 0 17 
X) | on- 
Uo 99 0 0 0 1 
if each equation is expanded into partial-fractions, then A= , E ; 
Xs) zs d Lo Pre | Pin s ; 
U(s) s—X 's—x, "gem m 0 0 l 
TG. Bog Po. + Pen 7, Tian x: J 
Ui) dM  F= 3," $—3 (4-67) em 
iai . ^ A, 0 0 JA 
Xs) PET Pa» p. Ü -—X 0 
= n n E xg nn X» J: 
U(s) i-% e TERN - à + [a] 
Or Pe 
X,(s) " l : . Li LI Li 
U(s) Dui Pu ^o Pin SeN Jn 0 0 D Nal Ln L 
X,(s) 1 It is noted that the preceding form is the same regardless of the value of b in 
U(s) |. Po Po Ps | | $— X: Eq. (4-64). (The elements of P, however, depend on the value of 5.) 
: Ts z If the first column of the matrix P given by Eq. (4-69) is divided by p,,, 
Á . the second column by p;,, ..., the last column by p,n, then the transfor- 
X,(s) l mation matrix P becomes P,, given by 
U(s) Pn: Pas Pnn pme 1 ] «0 f 
Y.(s Po Pe... Pan 
Pu Pi Pin ae Pu Pie Pin 
Ys Pu Pu... Pm 
E Poy Do» Pon uS P, = Pur Pris Din 
(4-68) pe 
Y.) Du Ds Pan 
Pny Pre Pnn U(s) Pu Pre Pir 
From Eq. (4-68) the transformation matrix P is obtained as This matrix must be equal to 
l 1 es] 
Pu Pr Pin " " " 
P» Pee Pon 1 2 n 
. e. A Ai zy X 
P= : (4-69) Bun 
Dni Pno d ADU x — E 
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Hence 
Pi = Xs Pz — " à Pan _ Xa 
Pu Di? Din 


or in a general form 
Py pM? (4-70) 


Thus, once p,,. pis, - - -> Pin are evaluated, P31, Poo, . . . , Pan can be obtained 
from Eq. (4-70). [Notice that Eq. (4-70) applies only to the case of distinct 
eigenvalues.] 


Example 4-8 Consider the following transfer function: 
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If we make the transformation x — Py, or 


1 
xy 2 


1 
1 
X, |=| —3 2 —4 ||): (4-71) 
P » -=g i V3 
then 

Y,(s) 1 

' U(s) f l 
Y,(s) 1 

U(s) s+2 
Y,(s) 1 

U(s) 5+ 3 


X(s) _ 


1 


UG) — “) = GEHEFDGED 


By use of the partial-fraction expansion technique obtain a state space representa- 
tion of this transfer function. 


Since 
Xs) _ t 1 
Us) - "9-7 6Gcr0DGcT2673) 
Xs) s 
2 C wW = 
Us) :"9-75rDGc-26T3 
LEONE = s: 
U)-:"G-76cDGT26GT3 
the partial-fraction expansions of these three equations become 
X5) U ot UR 
UO s-1*:s0127*543 
A i, -- a 
UD sl? Seo 32s 
X() 12 , —4 , 9n 
U() 3-1'5432^543 
or 
X,(s) i a ara 
Us) 2 2||s1 
MEL. in w cm d 
Us) 2 en 
X;(s) zb =á = 1 
U(s) 2 elles 
Thus 


Hence the given transfer function can be transformed into a set of threesimultaneous 
equations as follows: 


ý= —yı +u 
Jo = —2y, + u (4-72) 
Vs = —3y + u 


This is the simplified state space representation of the given transfer function. 
Here the y; and x; (i = 1,2, 3) are related by Eq. (4-71) and x, = x, X; = %, 
x, = X. The response x(t) is clearly given by 


x(t) = Ente) — yalt) + Lyslt) 
where y,(¢), y(t), and y,(¢) can be easily obtained from Eq. (4-72) once the forcing 
function u is specified. The initial condition terms are given by 
y(0) = P-'x(0) 
Notice that Eq. (4-72) is also a state space representation of the following dif- 
ferential equatior.: 


(p +D +2)(p +3)x =u, pg 


or 
X+ 6ž+1lž+6x=u 


State space representation of transfer function with multiple 
poles Next, consider the case where some poles of the transfer function 
are multiple poles. Consider the following transfer function: 


X(s) _ II CNN MM" 
U(s) s+ as ee) + anas + d 


Eh. b 
Ee =n eS) (773) 


This transfer function has double poles. The simplified state space equations 


corresponding to the double poles should be in the form 


Yi —»Syict)y 
Yo = My cu 
from which 
Fs) dl 
Y(s)) s—A, 
This suggests that Y(s)/U(s) be defined by 
Y,(s) _ 1 
U(s) (s— N) 
Y,(s) = 1 
U(s) s—h, 
Fs) — 1 
U(s) "gl S c Ns 
Eus) l 
U(s) z $ An 
Let X,(s) = X(s), X.(s) = sX(s), ..., Xp(s) = s"^! X(s). If 
Xs) _ Xs) as AaS) ae 
wo W(s), U(s) sW(s), Ts) > st) W(s) 
are expanded into partial-fractions, then 
Xs). Pu 4 Pis ra (2 3L ae n D. 
U(s)) (s—Xy s- S—A; © (0$ — Xn 
Xs) _ P21 D»» Die. xL. Dum 
U(s) ~ GAP TERE Em Nga 
X«G) _ ee. cee ee 2) ae ee ee ve de ok 
U(s)  (s—Xy s—X SA | §—Xn 
Let 
Pu Pie Pin 
Po Pa Pon 
P, = ; 
Pu Png *** Pm 
By dividing the second column of P, by a constant factor 
b í 


Or zx M) Ar e A4) dad A, aie An) 
and by dividing the first column, third column, ... 
Diis +++ Pin, respectively, P, can be modified into P, given by: 


(4—74) 


(4-75) 


(4-76) 


, ath column by p, 


1 —rL 1 eee 1 


Ai 1—A,L Ne =e Ne 
Pose X 2A, — ML P RED M 
AS GHIA ah apt e AD 
where , 
atd 1 UNT HN M 
Beg ECL Ay — An 
Then A 
0 
SE TEE S 
—A2- or —— i 
where P is defined by Eq. (3-50). Also, Py’ is obtained as 
LA, LA, ,., LA^ 
|P| IPI |P| 
0 0 0 
= Po! + Q " 0 
0 
(Refer to Prob. A-4-4.) Thus, 
PAP) = P'AP 
LA, LA, .. LA, 0 =L 
[et e |P| 
0 ee 0 0 —AL 
$ j : . APTP A 2 
0 0 0 0 —Ar'L 
LA LA... LAS je -6 o :9 
HE HM |P| : 
E 0 Ò =AL + 
Aoa Ze 
0 0 0 0 ATE 0 


(4-77) 


(4-78) 
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where 
0 1 0 
0 0 0 
A = 
0 0 ys l 
—aA, =ar Sh ecd 


The second and third terms of the right-hand side of Eq. (4-78) will cancel 
each other and the last term of the right-hand side of Eq. (4-78) is zero. 
(Refer to Prob. A-4-5.) Thus, we find Py! AP, = P-! AP. This implies that 


PAP =P CAP 
This means that as a transformation matrix P we may use either Eq. (3-50) 
or Eq. (4-76) or Eq. (4-77). 
The simplified state space equation of the system with double eigenvalues 
is given by 


» A. i B 0 Yı [0 
Ve 0 X 0 0 ye 1 


F 
o 
o 
ae 
o 
= 


P über. * Lp à; [u] — (4-79) 


Pn Ü € B xe Re 1 
This result is the same for any value of b in Eq. (4-73). 
Similar results can be obtained for the case where the transfer function 
involves multiple poles of multiplicity m. In this case Y(s)/U(s) are defined by 
Ys) 1 
U(s) (s—AX)" 


Ys) — 1 
i k- — À ii 2 


YAD 1 z 
UO a N (4-80) 


Y.(s) _ 1 
U(s) r^ d — Am+1 


F5)... 1 
U(s) s—Xs 
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Notice that 


Yi) 2E 1 Y.(s) ME 1 Y, i5) = 1 
YS) s— A, KO Uem XS) $—AX, 
The simplified state space equation is 
Vn AS d. Qo 4 Oi 0 Jı 0 
Ve 0 à ! ees d : Ja 0 
Ps 6 OU À gi m 0 
$&] [e 0 & ee X: x. 171118 
Be ee CE Dt DUE REM 4-81 
nii : Am+1 0 Ym+ı 1 ( ) 
In 0 0) Medd. | 


The transformation matrix P can be obtained from the coefficients of partial- 
fraction expansions of 


Xs) Xy) X.) 

U(s) ' U(s)' “°°? U(s) 
It is similar to that given by Eq. (4-76), where the first m elements p;,, Pio, 
-> Pim (i = 1,2, 3, . . . , n) in each row correspond to the m multiple poles. 


Example 4-9 Consider the following transfer function: 
X(s) 5 


UG) ~ "9-765 T1 6GT2 
Obtain a state space equation for this transfer function. 
Since 
Xs) 5 —5 5 
UG) EFP TFI F 
Ke 5 10 —10 
UO CFD BIO 42 
Xs) _ 5 € — —15 20 
Us) (sc-1)P'scl sc2 


the transformation matrix P, defined by Eq. (4-76) is obtained as 


5 —5 5 
ZE 10 —10 (4-82) 


Z =i 20 


By means of the transformation x = P, y the given transfer function can be trans- 
formed into the following simplified state space equation: 


» =] 1 07 Vi 0 
Hil 0 —1 0 | BEN [u] (4-83) 
E 0 0 —2]L» 1 


From this analysis we can immediately obtain a state space equation for the 
differential equation 


(p +1}(p +2)x= 5u p-—-- 
or 
X + 4% + 5x + 2x = Su 


The simplified form of the state space equation for this differential equation is 
given by Eq. (4-83). The initial condition terms are given by 


»,(0) 3 —5 ST px 
E =|-5 10 0 E] 
y3(0) 5 -15 20) L0 


x = Sy, — 5y: + 5y; 


and x is given by 


System with forcing terms involving derivatives of forcing 
functionu We shall next treat the system equation of the form 
(n) (n=1) 2 (n) (n=1) 
X+ ax tees + ani X + anx = byu + bu +--+ + bnih + bru 
In order to simplify our discussion, we first consider the case where n = 3. 
Then, the differential equation becomes 


š+ a,X + a,.X¥ + ax = bū + bü + baú- bu 
The corresponding transfer function is 
X(s) _ bos? + bis? + bas + bi 
Us)  s-ras--as44 
— bos? + bis? + bas + b; 
(s — Xs — Ag)(S — Az) 
Here we assume that the poles X,, A», and à; are distinct. (Extension to the 
case of multiple poles is straightforward.) We cannot use Eq. (4-67), since 
the order of the numerator of any of X;(s)/ U(s) (i = 1, 2, ..., n) is equal 
to or will exceed that of the corresponding denominator and, therefore, the 


partial-fraction expansion approach must be modified. 
Consider, as an example, the following differential equation: 


X + 6x + 114+ 6x = ü + 8ü + 170+ 8u (4-84) 


The corresponding transfer function is 
X(s) _ si + 8s? + 175+ 8 (4-85) 
U(s (s + Is + 2s + 3) 
From the analysis in Section 4-2, we know that the simplified state space 
representation of the system can be in the following form: 


Ys) _ 1 

U(s) s+! 

Y,(s)_ 1 (4-86) 
Uls) s+2 

S a E R. 

Uls) s+3 


In order to find Y,(s), Y.(s), and Y;(s), let us divide numerators of the 
right-hand sides of X(s)/ U(s), sX(s)/U(s), and s? X(s)/ U(s) by respective 
denominators and expand remaining terms into partial fractions. 


Kb. x. 1 4 i a 
vay sat “s-2 543 
sX(s) -— me" 
UG) rrr e s+2 s+3 
SEXO oa NUN dud 
Uy Sto ::1*3427343 
Hence 
xo —06) 1, 2 d 
memi sul $42 $45 
sX)—(-2U(.. 1 | 4 . 3 (4-87) 
— tay orl s+? 443 
sX(s) — (st + 2s— OU)_ —1 , 8 ._9 
U(s) $cTl s2'sc3 
If we define 


X,(s) = X(s) — U(s) p 
Xs) = sX (s) — sU (s) — 2U(s) (4-88) 
X,(s) = s*X(s) — UCs) — 2sU(s) + 6U(s) 


Eq. (4-87) becomes 


an [a s tb 

U(s) Sr 

¥(s)|_|-1 -4 -3|| 75 (4-89) 
U(s) re 

xo | 8 €i 


U(s) 2! s+3 


216 STATE SPACE REPRESENTATION OF CONTROL SYSTEMS 


From Eq. (4-86) and Eq. (4-89), we obtain 


Xi(s) —1 2 1] [ ¥,(s) 
X(s)| = 1 —4. —3 || Y,(s) (4-90) 
X;(s) =] 8 9 | | Y3(s) 

or 
Y,(s) —3 —$ —i]|[X» 
Y(s)|—-|—ià —2 —+]| X.(s) (4-91) 
Y,(s) 1 2 +] LX3(s) 


in transforming the differential equation (4-84) into the transfer function, 
Eq. (4-85), we assumed the zero initial conditions. Again assuming the zero 
initial conditions, Eq. (4-88) can be written as 
iL = x— a 
X, =xX—u—2u (4-92) 
X, =X—i— 2ú + 6u 
We hereafter relax zero initial conditions to arbitrary initial conditions. 
By use of Eq. (4-92), the initial condition terms y,(0), y,(0), and y,(0) can be 
written in terms of x(0), x(0), X(0) and u(0), (0), i(0) as follows: 
y, (0) = —4x(0) — $3(0) — 3x(0) + $4(0) + $u(0) + 5u(0) 
y«0) = —ix(0) — 2x(0) — 3x(0) + 4ü(0) + 32(0) + $u(0) 
ys(O) = 43(0) + $x(0) + x(0) — 44(0) — $u(0) — u(0) 
The simplified state space representation of the system given by Eq. (4-84) 
or Eq. (4-85) is 
Jie)» 
Jı = —2y, + u 
J: = —3y + u 
The original variable x can be obtained in terms of y,, y,, and y, as follows: 
x=x +u 
——y + 2y + y+ u 
Since y,, ¥., and y, can be easily obtained once u is specified, we can im- 
mediately find the response x. 
The procedure just illustrated can be extended to the general case where 
X(s)/ U(s) is given by 
X(s) _ bos" + bs! +--+ +5, 
U(s) Ss? ays?! +--+? + ay 
The simplified state space representation of the system where the poles are 
distinct becomes 
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Y _ 1 
U(s)) s—r, 
Y.(s) sted 1 
Y 1 
U(s)  S— Àn 


If the system includes multiple poles of order m, then the state space repre- 
sentation of the system becomes that of Eq. (4-80). 

The procedure for deriving an n X n transformation matrix P, which 
relates y,, Ya ..., Yn and xj, X4, ... , Xn, Where the x, are functions of x, u, 
and their derivatives, is identical to that just demonstrated by use of a simple 
example. 


Effect of cancellation of common terms in transfer function If 
the transfer function is given as 
X(s) __ K(s + a)(s + b) 4-93 
U(s) (s+ a)y(s + b)\(s + c)s +d) ( ) 
then cancellation of the common terms results in 
X(s) _ K 
U(s) (s+ c)(s +d) 
The effect of cancellation of terms in the transfer function lowers the order 
of the system. We used X(s) instead of X(s) in the canceled transfer function, 
since the order of X(s) differs from that of X(s) by two. By the technique 
presented in this section we obtain a state space representation of Eq. (4-94) 
as follows: 


(4-94) 


0) 
U(s) s+c 
X3) — 1 
Ul) s+d 
where Y,(s), ¥,(s) and Á(s) are related by 
" —x K 
X(s) B Dem d a= d ¥,(s) 
2  |seK -—dK 
sX(s) é= d = d Y.(s) 


Hence we obtain 
Wn Fe 01r», 1 
lato -alalta e» 


Equation (4-95) is a state space representation of Eq. (4-94), but not that of 


Eq. (4-93). To obtain a state space representation of Eq. (4-93), we must 
restore the order of the system. Let us define a state vector z as follows: 


Zi E 
2 = xs = 72 
Z3 Vi 
Z4 X: 
Then a state space representation of Eq. (4-93) is given by 
LA —a 0 0 © 1 E 0 
2, 0 —b 0 0|lz 0 
a | = le] | td 
23 0 0 —c 01|2; 1 
L2, 0 0 0 —d] Lz 1 


4-5 STATE SPACE EQUATIONS OF SYSTEMS 
DESCRIBED BY DIFFERENCE EQUATIONS 


Dynamic systems in which one or more variables can change only at discrete 
instants of time are called discrete-time systems. The behavior of discrete-time 
systems can be described in terms of difference equations. Note that in solv- 
ing differential equations by use of a digital computer, we reduce differential 
equations to difference equations. By means of introducing discrete-time 
values, the problem of solving differential equations is modified into that of 
solving algebraic equations. 

By use of a set of state variables, the behavior of discrete-time systems 
may be described conveniently in the form of vector matrix difference 
equations. In this section we shall first show that scalar difference equations 
can be written in the form of vector matrix difference equations and then 
show how vector matrix difference equations can be simplified. 

Consider the following scalar difference equation: 


x(k +n) + a,x(k +n — 1) 4- --- + a, x(k) = bulk) (4-96) 
where k denotes the Ath sampling time or kth state in the evolution of the 


process; x(k) is the system output at the kth sampling instant; and u(k) is 
the input at the kth sampling instant. Let us define 


X,(k) = x(k) 
x(k + 1) = x(k) 
xk + 1) = x(k) 


Xn-(k + 1) = x(k) 


Xalk + 1) = x(k + n) = —a,x«(Kk)—asxs-(k) — +--+ — anx,(k) + bulk) 


i 


Then Eq. (4-96) can be written in the following form: 


x(k + 1) 0 I ee 0 0 xs | 
x(k + 1) 0 0 E 0 0 x(k) 
E +1) 0 0 s a oe fe ee 
x,(k + 1) —An —Gn-, c: —@, —a, xe) | 
0^ 
0 
+] - p) 4-97) 
0 
b 


-d 


or 
x(k + 1) = Ax(k) + Bu(k) 
We shall next discuss simplification of vector matrix difference equations. 


The first matrix in the right-hand side of Eq. (4-97) can be simplified by 
means of the following transformation 


x = Py 
or 
x(K)].-[Pun Pe 7 Pin yilk)] 
x,(k) Po. Pze Pon y2(k) 
x,(k) Pai Pus Pand L¥nlk) 


The matrix P is given by Eq. (3-37), if the roots of the following characteristic 
equation 

A + aN"! -p e H an-A H a, =0 
are distinct. For the system with distinct eigenvalues, the simplified state 
space equation is given by 


z(k11 [0 -—- 07 [zk 
ze: Ty [9 Ag ee | p) l 

; =|° > i -+| e 
atnl lo v sla Li 
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or 
z(k + 1) = Dz(k) + Fu(k) 
where z(k) and y(k) are related by y(k) = Vz(k) and V is defined by P~’ B = 
VF. 
if the system has multiple eigenvalues, the state space equation must 
be modified in an obvious way. 
Next, let us consider a state space representation of the following dif- 
ference equation: 
x(k + 3) + a,(k)x(k + 2) + a,(k)x(k + 1) + a(K)x(K) 
= byKk)a(k + 3) + b(k)u(k + 2) + b(k)u(k + 1) + b(Kk)u(k) (4-98) 
We shall show that Eq. (4-98) can be written in the following form: 
x(k + 1) 0 1 0 x(k) c,(k) 
E + » = | 0 0 1 | o + E [u(k)] (4-99) 
x(k + 1) —ax(k) —a,(k) —a,(k)} |x3(k) es(k) 
Clearly, if we define i 
x(k) = x(k) — eo(k)u(k) 
x,(k + 1) = x(k) + ¢,(k)u(k) 
x(k + 1) = x(k) + e )u(k) 
Xj + 1) = —a,(k)x,(k) — a(K)xK) — a,(k) x(k) + e()u(k) 
Eq. (4-98) can be transformed into Eq. (4-99). Then, we obtain 
x(k + 1) — x(k + 1) + ek + 1)u(k + 1) 
= x(k) + e(k)u(k) + e(k + 1)u(k + 1) 
x(k + 2) = x(k + 1) + e(k + 1)u(k + 1) + ek + 2)u(k + 2) 
= X3(K) + e(K)u(K) + ¢,(k + T)u(k + 1) + ck + 2)u(k + 2) 
x(k + 3) = x(k + 1) + ek + 1)u(k + 1) + ei(k + 2)u(k + 2) 
+ co(k + 3)u(k + 3) 
= —ax(k) x(k) — afk) x(k) — a,(k)x3(k) + e(K)u(k) 
+ ok + 1)u(k + 1) + ek + 2)u(k + 2) + ek + 3)u(k + 3) 
Hence we obtain 
x(k + 3) + a (K)x(k + 2) + a(k)x(k + 1) + a5(k) x(k) 
= [—as(k)x,(k) — a,(k) x(k) — a,(k)x3(k) + e(k)u(k) 
+ ek + lulk + 1) + ek + 2)u(k + 2) + ek + 3)u(k + 3) 
+ [a,(k)xx(k) + a,(k)eo(k)u(k) + a(K)ei(k + 1)u(k + 1) 
+ a (K)e«(k + 2)u(k + 2)] + [as(K) xk) + a (X) e (X) (Kc) 
+ aalk)eo(k + 1)u(k + 1)] + [as(k)x,(k) + as(K)es(K)u(K)) 


(4-100) 
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= co(k + 3)u(k + 3) + [e(k + 2) + a,(k)eo(k + 2))u(k + 2) 
+ [ek + 1) + a,(k)e(k + 1) + a Go)e(K + 1)u(k + 1) 
+ [es(k) + a. (k)es() + a,(k)e(k) + as(k)eo(k)] u(x) 


The right-hand side of the preceding equation must be equal to the right-hand 
side of Eq. (4-98). Hence, 


[cok + 3) — bo(k)Ju(k + 3) + [e(k + 2) + a Q)e«(k + 2) — b,(k)u(k + 2) 
+ [e(k + 1) + a(k)e(k + 1) + a,(k)e(k + 1) — b(k)Ju(k + 1) 
+ [es(k) + a. (es) + a (K)e,(K) + as(k)eo(k) — bs(k)Ju(k) = 0 (4-101) 


If we equate the coefficients of u(k + 3), u(k + 2), u(k + 1), and u(K) in 
Eq. (4-101) to zero, respectively, then Eq. (4-98) is always satisfied. Therefore, 


Co(k + 3) = 5,(k) 
ci(k + 2) + a(k) cok + 2) = bi(k) 
e(k + 1) + a,(k) e,(k + 1) + a(k) e(k + 1) = b,(k) 
e3(k) + a,(k) ca(k) + a(k) ci(k) + a(k) co(k) = b.(K) 
it follows that 
co(k) = b,(k -- 3) 
c,(k) = b,(k — 2) — a,(k — 2) b,(k — 3) 
c(k) = b,(k — 1) — a(k — 1) b(k — 3) 
— a(k — 1)[b,(k — 2) — a(k — 2) b(k — 3)] 
c3(k) = b,(k) — a;(k) b(k — 3) 
— [a,(k) — a,(k) a(k — 1)] [b (k — 2) — a,(k — 2) b.(k — 3)] 
— a,(k) [b.(k — 1) — a(k — 1) b,(k — 3)] 


Equations (4-102) determine c,(k), c,(K), c, (k), and c;(k). The initial conditions 
are given by 


(0) = x(0) — co(0) u(0) 
X(0) = x(1) — co(1) u(1) — ¢,(0) uO) 
x40) = x(2) — cQ) u(2) — ¢,(1) u(1) — ¢,(0) u(0) 


We can extend the preceding technique to the nth-order difference equa- 
tion. Consider 


x(k + n) + a(k) x(k +n — 1) +--+ + a(k) x(k) 
= bo(k) u(k + n) + b(K) uk +n — 1) + ++- + br(k) u(k) 
(e= 0, d, 2. so) 


If the initial data x(0), x(1),..., x(n — 1) are known, Eq. (4-103) can be 
given in the following form: 


(4-102) 


(4-103) 


x(k + 1) 0 l e 0 [ x,(k) EXERCISE PROBLEMS 


4 
" | 
x(k + 1) 0 0 ws 8 x(k) | AND SOLUTIONS 
= : E š l | Problem A-4-1 Write the state space equation of the system shown in 
: j . . ; | Fig. 4-5 in which x,, x;, and x, constitute the state vector. Simplify it to the form 
Xn-1(k + 1) 0 0 eae 1 Xn-1(k) | where the coefficient matrix of a new state vector is in the diagonal form and the 
x(k + 1) —a(k) —a, i(k) +-- -aL x09 elements of the coefficient matrix of the forcing function u are unity. 
c,(k) 2 
c«(K) | s(s+1) 
| 
+) + e | 
| 
€,-,(K) 
Calk) ' 
(k —0,1,2,...) 
The x(k + 1) (i= 1, 2,..., n) are defined by > ! Figure 4-5 Control system. 
x(k) = x,(k) + co(k) u(k) | Solution The state space equation of the system is obtained by 


| writing the input output relationship for each transfer function and then taking 


x(k + 1)=-x, c) ulk i i 
(k + 1) xa(k) + e,(K) u(k) the inverse Laplace transform. From the diagram we obtain 


x(k + 1) = "a + Helt Ki ) | sX (s) = X (s) 

P a i (s + 3) X«(s) = 10[U(s) — X,(s)] 
— ikl) ty. n | sire Eon capra ees onn 

Xalk + 1) = —ay(k) xi) — av. (A) x) — + | ionia 


x = X; 
X, = —10x, — 3x, + 10u 
Xs = 2x» t€ 3x; 


—a,(k) Xn(k) + clk) u(k) 


where the c,(k) are determined from 


Cok) = b,(k — n) ; which can be rewritten as 
ek) = b(k — n 4-1) — a(k — n 4- 1) e) & O, 0: ADS 9 
ca(k)= MESA ns: 2 = b i X,|2|-10 —3 O} | x. | +] 10} [4] 
) RI i n+ si a(k — n + 2) ¢,(k) | x; 0 32 -3llx 0 
or 
Calk) = b m — anlk) e(k) — --- — axlk) cn-a(k) — a(k) cn- i(k) ! x = Ax + Bu 
The initial conditions are given by The eigenvalues of A are 
*x,(0) = x(0) — c,(0) u(0) i ù = —5, Am 41 “a P 
x(0) = x(1) — Co(1) u(1) — c,(0) u(0) A transformation matrix P for this system may be obtained as 
X3(0) = x(2) — ¢,(2) u(2) — c s u(1) — c.(0) «(0 i i j ' 
: dd s JB _5, sB 
bi oie uc ee NIE. 
X0) = i — Hea — 1) diio. edis 2) u(n — 2) — 3 
Y 15 I .i5 
— €n-9(1) u(1) — c,.,(0) u(0) i -$ -bOME Ss 
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If we put x — Py, then we obtain 
y = P^APy + P-'Bu 
The inverse of P is found as 


Eg A Lu 
6 12 12 
= 5 .A/15 1 WAS 4 EVAE 
PH) = free Dei | cee Mee 2 ja ie 
my. A lao muting 
3 fAAEIS 214819 I ACE 
12741 24 ^40 24 ^24 
Hence 
Ji —5 0 0 yy E 
os _1_ A5 5 . «5 
^ dic mm 9 dba SR 
1 .A/ 15 
sj | 0 0 TES ja) |-5-/^ 
Put y = Vz where V is defined by 
3 
ar 0 0 
0 B3 0 
5 /15 
a a E NES 
Then, the state space equation of the system can be simplified into 
Ži =5 0 0 Zi 1 
Ż 0 1 3 
à 0 0 -4+ |] z 1 
Clearly, z is related to x by 
x = PVz 


Problem A-4-2 Obtain a state space equation of the following system: 


X + 4% + 5X + 2x = Su 


Solution The characteristic equation has the following roots: 


A, = —], À, = —l, A3 = —2 
The following transformation matrix 
1 0 1 
P=| -1 1 —2 
1 —2 4 
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will transform the given differential equation into the following form: 


yy pl 1 01r» —5 
|= 0 -I1 Oll z |+ 5 | [u] (4-104) 
Vs | 0 0 2] L»: 5 
or y = Jy + P-'Bu. To simplify Eq. (4-104) further, let 
y = Vz 
where 
» Zi 5 =5 0 
y=| ye]; Z =| 2 15 V l0 5 
ys Z3 0 0 5 
Since 
$ 4 0 
Vt=/0 4 0 
00 i 
V-1!P-! Bu and V-!JV are obtained as follows: 
i£ O17-5 0 
V-'P-'Bu=|0 à 0 Sip] | 1 |] 
00 1 5 1 | 
i i 0][-1 1 OTS —S O —1 1 0 
v-ia b ol] 9 — 0//0 5 0j=| 0 —-1 0 
00 i 0 0 —2]10 0 5 0 0 —2 


The given differential equation is then transformed into the following form: 


Z —i 1 01rzi 0 
2y |= 0 —I O0llze |+| 1 | J 
i2, 0 0 —2] Lz 1 


Since x = Py = PVz, the vector x can be transformed into z directly by the 
following transformation matrix: 


1 0 I)[5 —5 0 5. —S 5 
PV-|-1 1 -2//0 5 0|2|—5 10 ani 
i2 ällö 634 $ —i$ 20] 
[Notice that PV is the same as P, given by Eq. (4-82).] 
Problem A-4-3 Obtain a state space equation of the following 
system: 
X -r6x 11x +6=34 +24 +4 


Solution Let us use Method 2 of Section 4-2. The given differential 
equation can be written as 


Xj 0 a d X €1 
X, |= 0 0 1} | x |+] cs | [u] 
F3 -6 -11 -é]ix] Le, 
where ¢,, c;, and c; are determined from Eq. (4—37) as follows: 
C = 3, C. = —16, Cg = 64 


Since the characteristic equation has roots —1i, —2, and —3 the preceding state 
space equation can be written as follows: 


Vi —1 0 Oll» [ 3 177 3 
ys. = 0 —2 0 E a —2 —3 —16 | [u] 
Vs 0 0 —3]Lx, 1 4 9 64 
=] 0 Tn 1 
= 0 -2 O} | » || —9 | [ul 
0 0 —3) Lys 11 
By use of a second transformation 
y = Vz 
where 
1 0 0 
V=|0 -9 0 
0 0 ll 
we obtain 
EA =] 0 Oy fey, 1 
ż |= 0 —2 Ollzs|-cTi 1 | fa] 
Zs 0 0 —3|lz 1 
Vector x and vector z are related by 
X i 1 ITPi 0 OT] fz, 
wje m —2 —3[[0 —9 Ollz 
Xs 1 4 9]10 0 o Z3 


The original variable x is related to x, as follows: 
x-—x 


The initial conditions on z are obtained as 


2,(0) 3 $ £][x(0) 
z.0)|—| 4 i $ x«(0) 
z;(0) Yr cw 2s} Lx(0)] 


and 
x,(0) = x(0) 
x(0) = x(0) — 3u(0) 
x4(0) = x(0) — 3ú(0) + 16u(0) 
Problem A-4-4 Given the following matrix P, 
1 —L 1 
P,-|A, 1-AL A, 
AZ 224.— ML M 
1 0 1 0 -L O 
—|X 1 As}/+}/O0 —AL 0 
AZ 2 M 0 —XAiL 0 
0 -L O 
=P+/0 —ALO 
0 —AiLO 
where L is a constant, show that the inverse of P, is 
LA, LA; LA 
ot pa dud 
: à 0 0 0 
0 0 0 


where A,, A», A; are the cofactors of the elements of the second column of P. 

Solution The determinant of P, and that of P are found as 

|Pi| = Q, — As)’, |P| e» Qa = A4) 
Hence 
|P,|=|P| 
The inverse of P, is 
EX Ag (Vy — Ag) — 2A Ae FAT LOAF—AD+2A, LA, Ae 
Ay Ag (Ay — Ag) A— Ai À;— As 

Ai —2nr,; 1 d 
Notice that the inverse of P can be obtained by letting L = 0 in Pī. Hence, 
LXMQ4—24) LOS — Ai) LO4— a 


1 
cic d duh R 0 0 0 
Pi * Ay iG A3)* 0 0 | 


E 


P = 


Ox v Aa)? 


Notice also that 
^ A; — Xs) = A, 
A, — As = A; 


! 
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Therefore, we obtain | Since 
LA, E. Lf | A, = MAS — iM 
PP = Py BU IRL ET | A: =M — Ai 
ü -0a O | A; = — As 
0 0 0 | we obtain 
Problem A-4-5 Referring to Eq. (4-78), (1) prove that AM + AM + AM = 0 
LA, LA, LA; ó ; | This proves (1) of the problem. 
PI TP] TP] 0|[|0 -L 0 | Noting that , 
0 0 0 a. $. tile - o” | & dau SIN 
0 0 oO ck ch xudis xm n a, + ad, + aM = —M 
where | 4, + añ; + a = —M 
| we obtain 
L = — = constant | 
1 3 | LA, LA LA 0 1 0 1 0 1 
a, = —(2r, + X) Py Tel TE1 
à, = Me +} | 0 0 0 0 0 1 $e d 9 
a, = =M "REN 0 0 0 —a, —a, —a LM 24 M 
1 0 1 | 1 0 17-2 0 1 0 Q0 EL Q 
Po 1 » dX, d 4 0 0 1 XE WE 
Ai 24 M : Aj 24 M —a, —a, —a,ji0 —ML 0 
and A,, A>, A; are the cofactors of the elements of the second column of P. | Ad + AM + AA A + 2AM + 3AM Airs + A5 + AAS 
(2) Prove also that = TE 0 0 0 
LA, Là E | 0 0 
TPT TPT n 0 1 0/1 O0 1 | " 
à 8 d = Bis Dy —Jj ü NE S 
E A e S pM t Aa ! + apt Dakot A-3X[ XQ—X]]0 up 0 
— T u— 2 
0 4, —à,; —a | LM 2M M | Ai —2r 1 0 —XL O 
1 0 ITT 0 1 0 0 -L O 
0 Art 2A04 3A 0 0 eM hat By. 6 
+] A, 1 Xs 0 0 1 0 —XL 0|-0 . ¥ ‘ 1 " 3^1 ula L : 1 ~ i^a ol 
2 2 = Tr TST 
At 2A, M —a; ~ü; —dü, 0 —XiL 0 i |P| 0 0 0 |P| 0 0 0| 
Solution Noting that ERES V i ! 
^ N d 43 + a^, + a^i Ai, we obtain 0 A E2A94-F3APM — M — MM -ERIADS 0 
EN, i m 
4 3 0 Ü . 02 £116 x 0 x ý 
0 0 0 =ü; — dio —X o9 Notice that 
=L? A, + 2AA + 3AA — A} — AAS T 2AA 
0 AA, +A A2 X i 1 E D 3 1 3 
- TPT er PD B i = Na — MAF 204 — MY + 304 —4)M — M — «XM + DIDS 
0 0 0 =0 
" 0 0 | This proves (2) of the problem. 


Problem A-4-6 Given the following state space equation: 


X 0 L Oy Ee 1 
Xe: = |0 o afal a foa 
Xs l c3 8x; 1 
obtain the corresponding scalar differential equation in terms of Xj. 
Solution The characteristic equation is 
A — 32+ 3X1 —1=0 
The desired scalar differential equation can then be written as 
X — 32, + 3%, — x, = boll + b + batt + byu 
The constants 5,, bı, b», and b, are found from Eq. (4-46) as follows: 
b, = 0; b, = 1, b, = —2, b= 
Hence we obtain 


*; — 34, 434 — x = ü — 2ú +u 


or 
(p—1*x-(p—-lyjy pat 
dt 
Problem A-4-7 Consider a nonsin i isti i 
gular matrix P con - 
| ly independent column vectors P}, P5, ..., P, Or E d 


| P = [Pi P, IP, 

For an n x n matrix A if AP; can be expressed as 

| AP, = buP, + b4P, +- + bnPpn 

| where the b; (i,j = 1, 2,...,n) are constants, then show that 
| by ba e bin 


b, b, ae ban 
Solution Notice that AP; is the ith column of AP. Hence 


z n 
X5 =e D bap 
PES j=l 


AP = [AP, i AP i ---!AP,] = È bap; aceae È brpa 


n n 
D b5apa ce 2 DinPnj 
D 


jz1 
Pu Pr Pin | [Ow bg b, 
E Daz Pon || be, bas Ds 
E E x = PB 
Pm Pas Dn ba bas Dig 


Since P is nonsingular, we obtain 


PAP =B 
_ From this we see that if we denote by Q; the ith row vector of P~', then 
QAP; = bij 


Problem A-4-8 Suppose that A is an n X n nonsingular matrix and 
B is an n x 1 matrix. Suppose also that B, AB, AĉB, ..., A"-!B are linearly in- 
dependent. Define an n x n nonsingular matrix P such that its columns consist 
OE DP, P... so, ens OF 
P = [Pi Pai i P] 


where P, =B 
P, = AB 
P, = A*-!B 


Show that it is possible to write P-! AP as follows: 
00 -—. 0 —a, 

] 0 : Q0 —a4 
0 1 ess O —a5 


0 0 o 1 —a 
Solution The matrix P is given by 
P = [P; iB; i --- Pp] = [B AB +-+: | A"7!B] 

First, let us compute AP. - 

AP = A[B! AB; --- | A”-!B] = [AB A°B | --- | A*B] 
Then compute PC. 
0 9 ws © = 
1 
0 1 =e 0 —a, 


— 03-1 


© 
o 


PC = [B! AB! --- | A?-1B] 


0 0 + 1 —a 
= [ABİ A?Bi ---! A"ÀB; —a,B — a,-,AB — --- — a, A"^!B] 
Since the column vectors B, AB, ... , A?-!B are linearly independent, all other 
n vectors can be formed from a linear combination of these independent vectors. 
If we pick a}, a5, ..., a, such that 
A"B = —a,B — a, ,AB — --- — a,À?-!B 


then 
AP — PC 
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Since P is nonsingular, we obtain 


- o 
oo 
oo 
E d 
A A 
3 3 
a 

| 


P-1AP = C = 


© 
b 
.. o 
| 
n 
3 
1 
2 


00 ~. 1 —a, 
Problem A-4-9 Consider the system 


X = Ax + Bu (4-105) 


Show that if A has rank n and column vectors B, AB, A?B, ... , A”—!B are linearly 
independent, then by means of a suitable transformation, Eq. (4-105) can be trans- 
formed into the following form: 


y=Cy+ Fu (4-106) 
where 
00 0 —a, | 1 
E 0 —a,-; 0 
c=]? 1 0 —aa| pul 0 
0 0 +++ 1 —a 0 


Here, @;, d,...,@, are constants determined by A and B. (The elements of A are 
4;; which are different from a}, as, ..., a4.) 


Show also that by eliminating y;, ys, . . . , ¥n-; from Eq. (4-106), we can obtain 
the following scalar differential equation: 


(n) (n-1) (n-2) 1 
Ja aya rasa boos + Qn-1 Yn T ann SU 


Solution Define the matrix H to consist of n linearly independent 
column vectors B, AB, A?B, ... , A^-1B, or 


H = [BI ABI A?Bi --- | A?-1B] 


Then H is nonsingular and H-! exists. Let us put x — Hy. Then Eq. (4-105) 
becomes 


y = H^'AHy + H-!Bu 
Observe that 


1 1 
0 0 

H| : |=(BiAB}---;A""B]| - |=B 
0 0 
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Thus 


Computing AH we obtain 
AH = A[B} AB i --- i A"?! B] 


= [AB! A?B! - - - | A"B] (4-107) 
Next compute HC. 
0 0 0 —a, 
1 © 0 —a4 
0 —a,- 
Hiit + ies 
0 0 1 -a j| 
ro 0 0 —a, 
1 0 Ü. 25-5 
= [Bi ABi-:- | A"7!B] 0 1 " m 
00- 1 —a 
= [AB! A?B| --- iA" Bi —a,B — a4-,AB — -+ - — a, A”-1B] (4-108) 


Comparing the right-hand side of Eq. (4-107) and the right-hand side of Eq. (4- 
108), it is obvious that the first n — 1 corresponding columns are equal. If the 
a, are chosen such that 

A"B = —a,B — a,-,AB — +++ — a, A^ B 


which is equivalent to 


— n — an 
— ân- —3-; 
A?B = [B! AB} --- i A”-1B] =H 
—a, —a, 
or 
—ü, 
—an-1 


H-'A™B = (4-109) 


—a, 


then the right-hand side of Eq. (4-107) and that of Eq. (4-108) are equal. This 
means that if we determine the a, according to Eq. (4-109), then Eq. (4-105) can 
be transformed into Eq. (4-106). 
Since Eq. (4-106) is equivalent to 
dy = —Anyn bu 


Ys = yi — An- Yn 


Jn = Ya-i — A; Yn 
» Yn-ı We obtain 


(n) (n=1) (n-2) 
Yn +AYn + AiYa dod Qn-1 Yn F days = U 


Problem A-4-10 Show that, by a suitable transformation, the system 


by eliminating y,, ys, ... 


X 0 1 O1 (x, —1 
A&|-| 3 0 2||x|-| 2]|[] 
X; —-D —7 —6]lx 3 
can be transformed into 
Ži 0 1 Od Tz 0 
2, [=] © 0 1} | z |+] 0 | [J 
25 —6 —11 —6] Lz 1 
Solution Let us perform the transformation 
x = Hy 
where H = [B| AB} A?B] 
and 
0 1 0 - 
A= 3 0 2| B=| 2 
—12 —7 -6 3 
After a simple computation, we obtain 
-1 2 3 
H=| 2 3 —34 
3 —20 75 
and 
—455 —210 —77 
H^ = g| —252 —84 —28 
—49 -14 -—7 
Since 


—34 
AB = | 159 
—248 


we obtain by use of Eq. (4-60) 


— 755 (745573) + (—210)(159) + (—77)(—248)] = —6 


inm a l(—252)(—34) + (—84)(159) + (—28)(—248)] = —11 


—a, = Sh (749-34) + (—14)(159) + (—7)(—248)] = —6 


Hence 
y = H-!'AHy + H^!Bu 
becomes 
Vn 0 0 —6 ^» 1 
»|-|10 -u]||» |+] 0 |t 
Vs 01 —6) Lys 0 


By eliminating y, and y, from the foregoing equation, we obtain 
js + 69s + 1194 + 6y; =u 


Let us define 
Zi Js 
Z=| 2 |=! Ys 
Z3 ds; 
Then, we obtain the desired result: 
Zi 0 1 07) fz 0 
s = 0 0 1}|z.}/+] 0 | {uJ 
Cs al -06. = 6 Z3 1 


Problem A-4-11 Obtain a state space equation of the following 
transfer function 
X(s).. K(s — a)(s —b) 
U(s) (Ss —Ay)(S —23)(s — Az) 


where the poles X, Aa, and A; are distinct. 


Solution 

X(s)_ Dui Pio Pis 

Do tAn tg F 

sX(s) _ Pos Pos Des 

U(s) Edi D "EE pem ek 

i: p P32 Pss 
ii i ea 

By defining 
As) = X(s) 


X,(s) = sX(s) — KU(s) 
X;(s) = s?X(s) — KsU(s) — KQ4 + Xs + As — a — b)U(s) 
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we obtain 


E Du Pi Pis 5 z - 
US =| Poa Pu Pss $ E Ne 
E Ps: Ps Ps = 
The simplest state space representation of the system is 
Yi(s) 1 
U(s) $—M 
Ys) | | 1 
U(s) S— Ay, 
Y,(s) 1 
U(s) S — À; 
or 
YW = MY +u 
Jo = My. + u 
Js = Mays + ue 
A transformation matrix P is given by 
x Pu Pie Pis| | M1 
Xa | 5| Por Poe Pozs|| Ye 
Xs 31 Ps P3 Js 


and x is obtained as 


X = X, = Duy: + Piy: + Diss 
The initial condition terms are given by 


y,(0) Pu Pi Pi | [ x,(0) 
»«(0) |=| Por P22 Pos x«(0) 
y3(0) P31 P32 Pa x;(0) 
where 
x,(0) x(0) 
x,(0) | = x(0) — Ku(0) 
x;(0) %(0) — Ku(0) — K(i + M + às — a — b)u(0) 
(Notice that, although the transfer function assumes the zero initial conditions, 


once the state space equation is obtained we can relax the zero initial conditioris 
to arbitrary initial conditions.) 


Problem A-4-12 Obtain a state space equation for the following 
transfer function: 
X(s) _ s+2 
U(s) (s+ 1)?(s +3) 
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Solution in the partial-fraction form, the given transfer function 


becomes 
Xy . 1/2 1/A , —1/4 
U() (tif scl sc3 


Multiply X(s)/ U(s) by s and expand it into the partial-fraction form 
sX(s |  s(s--2) —1/2 1/4 3/4 


U(s) “G+D%s+3) G+)? sel SFI 
Multiply X(s)/U(s) by s? and divide the resulting fraction 


St(S). SSD) — o... 3$ SES 
Us) (s-cDWs-c3) G+ 1s + 3) 


Then rearrange 
s? X(s) — U(s) | —3s? — 75 —3 


U(s) —(-c1*( 4 3) 
1/2 —3/4 , —9/4 
=a FI "sei 343 
By defining 
X(s) = X(s) 
Xs) = sX(s) 
X,(s) = s?X(s) — U(s) 
we obtain 
TX) i d LTD d 
U(s) 2 4 4 || (66 +17 
xo .1 i 3)) 1 
U(s) | 2 4 4 SHl 
xo||1.3.9||t. 
U(s) 2 4 4 $43 
By the following transformation from x to y 
x Eo x “Ei 
X) |=| —4 i i J2 
Xs i —% —2 Lys 
we obtain 
Y(s). 1 
U(s) (s +1) 
¥(s)_ 1 
U(s) s+1 
Ys) | 1 
U(s) s+3 
or in the state space representation 
j= -y th 
J: = —ys +u 
Ys = —3y + u 


The initial condition terms are 


»,(0) 4 $ -—Ip'px(Q) 
70) =] -5 i t1 0) 
¥3(0) 2 =F SE Lag) 
x,(0) = x(0) 
*,(0) = x(0) 


x;(0) = x(0) — u(0) 


Problem A-4-13 Obtain a state space representation of the following 
difference equation: 


x(k + 3) + 5x(k + 2) + 7x(k + 1) + 3x(k) = 0 
Solution Let us put 
x(k) = x(k) 
x(k) = x(k + 1) 
x(k) = x(k + 1) 


where 


then, we obtain 


x(k + 1) = Ax(k) 


where 
0 1 0 
A=! 0 0 1 
—3 —7 -5 


Problem A-4-14 Rewrite the following scalar difference equation 
x(k + 2) + 3x(k + 1) + 2x(k) = Su(k 4- 1) + 3u(k) 
in the form of a state space equation. 
Solution The equation 
x(k + 2) + a(k)x(k + 1) + a(k)x(k) 
= by((K)u(k + 1) + b(k)u(k) 
can be written as 
x(k + 1) = x(k) + b(k — Du(k) 
xXa(k + 1) = —ax(k) x(k) — a,(k) xo(k) 
+ [6.(k) —a,(k)b,(k — 1)] u(k) 


Foa CA 
nM CU NNI 


Thus, 


In the present equation, 
a,(k) = 3, 
b(k) = b(k — 1) = 5, 


a(k) =2 
bk) = 3 
Thus, we obtain 


BE AESi 


Problem A-4-15 Obtain a state space equation for the following 
difference equation: 
x(k + 3) + 5x(k +2) + 7x(k + 1) + 3x(k) = u(k + 1) + 2u(k) 
Solution Let us define 
x(k) = x(k) — egu(k) 

x(k + 1) = x(k) + eu(K) 

Xa(k + 1) = x(k) + cou(k) 

x(k + 1) = —3x,(k) — Tx«(Kk) — 5x(K) + c ulk) 


Then we obtain 


x(k + 1) = Ax(k) + Cu(k) (4-110) 
where 
x(k +1) 0 1 0 i, 
x(k + 1) =| x.(k +1) }, zl 0 0 ls C= e | 
x(k +1) =g a c;(k) 
From Eq. (4-102) we find 
€, —0 
€i = 0 
C= 1 
C. = —3 


The roots of the characteristic equation |A — AI| = 0 are found to be —1, —1, —3. 
In order to simplify Eq. (4-110) let 

x(k) = Py(k) 
where 


Then Eq. (4-110) becomes 
y(k + 1) = P! APy(k) + P~! Cu(k) 
where 
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and 
j=) Bros i 
P-C=|} 2 1 1|2| 3 
P P Hie) oL 


By the use of the following transformation 


y(k) = Vz(k) 
where 
$i 9 
V=|0 £i 0 
00 —; 
we obtain 
—1 1 0 
V-P-1 APV = 0 —I 0 
0 0 —3 
and 
2 —1 0 i 9] 
V'P-1C —| 0 2 0 ¿|=| 1 
0 0 —4j| — 1 | 
The simplified state space equation is given by 
zj(k + 1) —1 1 07 [z,(k) 0 
Z({k+1)|= 0 —I1 O|] zo(k)} +] 1 | [u(&)] 
z3(k + 1) 0 0 —3_] Lz;(k) 1 


where the initial data z(0) are given by 
z(0) = V-!y(0) = V-!P-1x(0) 
and 
x,(0) x(0) 
x(0) = | x,(0) | = x(1) 
x,(0) x(2) — «(0) 


In order to obtain z(0), we must know x(0), x(1), x(2), and w(0). 
Notice that in this problem 


x= PVz 
where 
[ i Pest 
PV =| —} i i 
"TU 


will directly transform Eq. (4-110) into the simplified form. The transformation 
matrix PV is the same as the transformation matrix obtained in Prob. A-4-12 by 
the partial-fraction expansion technique for a similar system 
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A) V s+2 
U(s) (s +1P(s + 3) 
or 
(s? + 5s? + 7s + 3) X(s) = (s + 2) U(s) 
Notice also that in Prob. A-4-12, x,(t), x«(t), and x,(1) are defined by 
x(t) = x(t) 
X(t) = X(t) 
x(t) = X(t) — u(t) 
whereas in this problem, x(k), x(k), and x4(&) are defined by 
x(k) = x(k) 
x(k) = x(k + 1) = x(k + 1) 
x(k) = x(k + 1) — cou(k) = x(k + 2) — u(k) 


PROBLEMS 
Problem B-4-] Obtain state space equations of the following systems: 
(1) X+5X4+7%4+3x=u4+2u 
(2) #453 +7% +3x = ü -+ 34+ 2u 
Problem B-4-2 Given the following state space equation 
X, 0 1 OF [ x; 0 
X,|2|-2 —3 O}} x J+) 1) [u] O 
X —1 1 —3jJ Lx 2 
Obtain the corresponding scalar differential equation of the form: 
Jı +a Y, + any, + ayı = b ü + brú + bzu 
Examine if it is possible to transform the original equation into the form 
Js + aÏ; + aý; + azy; = u 


Problem B-4-3 Suppose that A is an » x n nonsingular matrix and 
Bis an n x 1 matrix. Suppose also that the column vectors B, AB,..., A"*!B are 
linearly independent. Define an n x n nonsingular matrix P such that 


P = [A"!Bi A*7?B! - - -| ABiB] 


Show that 
-— i 0 0 
—ds 0 1! 0 0 
P-1AP = Dos 
—a,-, 0 0 0 1 
— ån 0 0 0 0 


where à;, à», ... , a, are constants determined by A and B. 


Problem B-4-4 Obtain a state space equation for the following trans- 
fer function: 


X(s) _ (s + 2) 
Us) (s$--s-c-D(crl 
Problem B-4-5 Transform the following differential equations into 
state space equations by use of the partial-fraction expansion technique. 
0) #4+3%4+3x+x=u 
(2) Ë +3 +3% +x=ůú+u 


n» Problem B-4-6 Derive a state space equation of the system shown 
in Fig. 4-6. By what transformation can we transform the state space equation into 


Figure 4-6 Control system. 


the simplified form where the coefficient matrix of the state vector is in the diagonal 
form and the elements of the coefficient matrix of u are unity ? 


Problem B-4-7 Transform the following difference equation: 
x(k + 2) + 3x(k + 1) + 2x(k) = 2u(k + 1) + 3u(k) 


into state space equations where the coefficient matrix B of the forcing function 
u is given by 
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Matrix Analysis 


5-1 INTRODUCTION 


In Chapter 2 we presented preliminary materials on matrix analysis. In this 
chapter, we present additional materials on matrix analysis which are needed 
in the rest of this book. Specifically, we shall discuss functions of a square 
matrix, quadratic forms, and Hermitian forms. 

In the succeeding sections of this chapter, we proceed as follows: Section 
5-2 presents matrix polynomials, Frobenius theorem, Cayley-Hamilton 
theorem, the minimal polynomial, and the exponential function of a matrix. 
Section 5-3 gives Sylvester’s interpolation formula. In Section 5-4 we discuss 
real quadratic forms and Hermitian forms and shall present theorems on 
real symmetric matrices and Hermitian matrices. We shall also, discuss 
definiteness, semidefiniteness, and indefiniteness of real quadratic forms and 
Hermitian forms. Finally, Section 5-5 presents Sylvester’s criteria for defi- 
niteness and semidefiniteness of real quadratic forms and Hermitian forms. 


5-2 FUNCTIONS OF SQUARE MATRIX 


In this section we shall discuss polynomials in a square matrix, Frobenius 
theorem, Cayley-Hamilton theorem, the minimal polynomial, and the expo- 
nential function of a matrix. 


Polynomials in square matrix Let A,, Ao,..., A, and B,, B., 
..., B, ben x n matrices. Define f(x), g(x), and A(x), where x is a scalar, 
as follows: 

243 
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EL i 
f(x) 2 Ax 


a(x) = X Bw 


j=0 
A(x) = f(x)e(x) = >, I A,B,xt*i 


We must be careful in substituting an n X n matrix C for x in the preceding 
equation for A(x), since the multiplication of matrices is not commutative 
in general. We have the following relationships: 

p q 

Y X A,B, Cit 


j-0 


i= 


o 


q q q 
= A(X B,C!) + A (È BC')C + --- + A (X BC) 
j=0 j=0 j=0 
= PA A.C!) (X B,C’) if B and C commute 
z x A,B; CH s (= AC) (Z B,C’) 


Notice, however, that 


if B and C do not commute 


X X ABC =0 if 


i=0 j= 


> BC —0 
j=0 
Also 

p g 

23,2; C AB, 


i=0 j=0 


-(£e4)(& 0s) 


if A and C commute 


if A and C do not commute 


EECUAB(LcA)(àom) 
Notice that 


*YXCHUAB0 if 


i=0 j=0 


We shall use these results occasionally. 


Y CA, =0 
i=0 


Frobenius theorem We shall next present Frobenius theorem 
concerning the eigenvalues of a polynomial in an n x n matrix A. 


Theorem 5-1 (Frobenius theorem) 
values of an n x n matrix A are Ai, A», . 
polvnomial in x, or 


Suppose that the eigen- 
-<-s An. Let p(x) be an arbitrary 
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p(x) = a,x? + a,x” + e a, 
Then the eigenvalues of the matrix polynomial p(A) where 
P(A) = aA? + a, A?! + --- + ayI 


are pi), PA), - - -> PAn). 


Proof Let the roots of p(x) — X = 0 be jn, Hs, ...; Hp- Then 


po) — A= ax — i) = pa) “+ = Me) = ao JẸ Œ — n9 


Noting that 
ahi + H: cc: + Hp) = —a 
Qy (My Ha + Kiks + cto + Hp-1Hp) = a, 


` doliis +++ Up = (—)’ (a — X) 
we have 
P(A) — AL = aA? + a, A?^! + --- + (ay — A)I 
= a,(À — mI) (A — pI) --- (A — u,I) 
Taking the determinant of p(A) — AI 


|p(A) — AT] = a£ |A — i4 1]| A — uI | -+ A — us | 
Since 
JA — M| = (A, —2)04 —2) +++ An — A) 
we obtain 
IA) — 1| = ar TT s — n2] [HT Gs — no --- [YT 05 — 22] 
= f ay iI (A; — uu) 
n í p 
= J] [a O- 29] 
= ITuo02 - x 
= [PA — M [PAs) — X) «++ (205) — X] 
This proves that the eigenvalues of p(A) are p(A;), P(A»), . . . , PAn). 


Cayley-Hamilton theorem The Cayley-Hamilton theorem 
plays an important role in simplifying polynomials in ann X n matrix A. It 
states that A satisfies its characteristic equation. 


We shall first give an illustrative example and then present the theorem. 


Consider the following matrix A: 


1 2 0 
A=|3 —1 —2 
1 0 —3 


The characteristic equation is 
IA — AIl = —33 — 3 + 74-17 — 0 


Notice that (A — X I) and adj (A — X I) are polynomials in X of degree one 
and degree two, respectively. Namely, 


Low 1 2 0 
A—Al=—|0 1 01X24 [3 —1 —23 
0. 0 1 1 0 —3 
and 
A? -F4X 4-3 2X +6 —4 
adj (A — ATI) = 3X -- 7 M+2%—3 —2r~+2 
r+ 1 2 A2—7 
I oOo 4 2 0 3 6 —4 
= 1 93-153 hj —x 7 
0 0 1 I 0 6 i'"x- 2 


Observe that adj (A — AI) is a polynomial in X of degree n — 1, where n — 3 
in this example. If we multiply (A — AD) by adj (A — AE) we obtain 


(A — XI) adj (A — AI) 


100 -3 0 v 

-—|0 1 ola +| 0 —3 olx: 
001 Ü @ =3 
700 17 0 0 

+10 7 Ola+] 0 17 0 

Uu 0 7 0. 0. I7 

= — XI — 301 + 7AE + 171 

—(—X! — 37+ 7A. + 17)I (5-1) 


Observe that if we substitute A for X in Eq. (5-1), then 
— A — 3A* + 7A + 171 


3 14 12 7.0 —4 
——|27 —11 —38|—3|—2 7 8 
13. —6 —3l =—2 8 


L 2 0 Io 0 
+7|3 —1 —2|/+17/0 1 0 
1 0 —3 00 1 
0 0 0 
=|0 0 0 
0 0: 0 


Thus, if A is substituted for X in the characteristic polynomial multiplied by 
the identity matrix, then the result is zero. This is an illustrative example of 
the following theorem: 


Theorem 5-2 (Cayley-Hamilton theorem) Let A bean n x n 


matrix and its characteristic equation be 
|A — AT] = (=) Qv + eX7! + --- + aed? + Cn: + Cn) —0 
The matrix A satisfies its characteristic equation, or 
A” + c A! + +--+ + Cnag A? + Cn- A+ Cpl = 0 


Proof To prove the theorem we first note that adj (A — AI) 
is a polynomial in X of degree n — 1, namely, 
adj (A — AI) = B,X7^! + BA"? + --- + B, 4A? + B, 4X + B, 
where B, = (—r-'I 
Note also that 
(A — AD adj (A — AD) = [adj (A — AD] (A — AD = [A — Ai |i 
Hence we obtain 
(A — AMLI = (—) AN" + c, Fat! + --- 47 6 4I? + c4 4 I + nD) 
= (—Ix + AXB, X” + BA”? + --- + B4.,,A* + Brna A + B,) 
= (Bii! + Bvt? + --- + B, + B,_,A + B,\(—DT + A) 
From this equation, we see that A and B, (i = 0, 1,2, ..., n — 1) commute. 
Hence the product of (A — AI) and adj (A — AI) becomes zero if either one 
of these is zero. If A is substituted for X in this last equation, then clearly 
A — XI becomes zero. Hence, 


A" -- c, A"! + --- + Cr- A? + c4 ,A + CCnl = 0 
This proves the theorem. 
Minimal polynomial As stated in Theorem 5-2, every n x n 


matrix A satisfies its own characteristic equation. The characteristic equation 
is not, however, necessarily the scalar equation of least degree that À satisfies. 


248 MATRIX ANALYSIS 


The least degree polynomial having A as a root is called the minimal poly- 
nomial; namely, the minimal polynomial of an n X n matrix A is defined as 
the polynomial (A) of least degree 


f(A) — A" + a,A"7! + +++ + an, m<n 
such that ¢(A) = 0, or 
(A) = A" + a,A™'+ .---+a,1=0 


The minimal polynomial plays an important role in the computation of 
polynomials in an n x n matrix. 


Theorem 5-3 Let d(d), a polynomial in A, be the greatest 
common divisor of all the (n — 1)-rowed minors of an n x n matrix 
(A — AT). If the coefficient of the highest degree term in X of d(A) is chosen 
as (— 1)", then the minimal polynomial $(A) is given by 


214 — AI 
W= s 


Proof By assumption, the greatest common divisor of the 
matrix adj (A — AJ) is d(A). Therefore 


adj (A — AD) = d(X)B(X) 


where the greatest common divisor of the z? elements (which are functions 
of X) of B(A) is unity. Since 


(A — AD adj (A — AD = [A — AI |I 
we obtain 
d(X)(ÀA — ADBA) = |A = ALI 
from which we find that | A — AI | is divisible by d(A). Let us put 
|A — M| = d OQ) 


Then the coefficient of the highest degree term in X of 4/(X) is unity. Since 
we have 


(A — ADBA) = WAI 
it follows that 
(A) = 0 
Note that (A) can be written as 
WA) = APA) + a) 


where a(A) is of lower degree than (A). If 4/(A) = 0, then a(A) = 0. Since 
@(X) is the minimal polynomial, a(A) must be equal to zero, or 


ala N — af ANS 


o ——— —— ER wes - B 


MATRIX ANALYSIS 249 


Note that because (A) = 0, we can write 
pA = (A — ANCA) 
Hence 
YA) = APANI = g(rA)(A — ACA) 
and we obtain 
BA) = g(r)C(r) 
Note that the greatest common divisor of n? elements of B(A) is unity. Hence, 


go) = 1 
Therefore, 
Wr) = dQ) 
it follows that ' 
. |A — M| 


This proves the theorem. 

It can be shown that similar matrices have the same minimal polynomial. 
Namely, for a nonsingular matrix T, the minimal polynomial of T-'AT and 
that of A are the same. (For the proof, see Prob. A-5-3.) 

The minimal polynomial $(A) of an n x n matrix A can be determined 
by the following procedure: 


1. Form adj (A — AI) and write the elements of adj (A — AI) as factored 
polynomials in X. 
2. Determine d(A) as the greatest common divisor of all the elements of 
adj (A — AD. Choose the coefficient of the highest degree term in X 
of d(X) be (— 1)". If there is no common divisor, d(A) = (— 1)". 
3. The minimal polynomial $(X) is then given as | A — AI| divided by 
d(X). 
Whenever A has distinct eigenvalues, apart from the constant factor 
(—Y the minimal polynomial is identical with the characteristic polynomial. 
Note that if the multiple eigenvalues of A are linked in a Jordan chain, 
the minimal polynomial and the characteristic polynomial are identical. As 
an example, consider 


2 1 4 
A-—|0 2 M 
i 3, 1 
The characteristic polynomial is given by 


2—2X 1 4 
0 2— 0 


n a T ls 


[A — M| = = (2 — AP (1—)) 


Thus the eigenvalues of A are 2, 2, and 1. It can be shown that the Jordan 
canonical form of A is 


and the multiple eigenvalues are linked in the Jordan chain as shown. In the 
following we shall illustrate that the minimal polynomial and the character- 
istic polynomial of this matrix A are the same. To compute the minimal 
polynomial let us obtain adj (A — AL). It is given by 


(2 — XX1 — X) (X + 11) —4(2 — X) 
adj (A — AD = 0 (2 — AXI — X) 0 
0 —3(2—X) (2— Xy 
Notice that there is no common divisor of all the elements of adj (A — XI). 
Hence d(x) = (— 1)* = — 1. Apart from the minus sign $(X) is identical 
with the characteristic polynomial, or 
fA) = —|A—AT] = —(2—»*(1 — X) 


=A =N Sk 4 
A simple calculation proves that 
A’ — 5A* + 8A — 41 = 0 
but 
A? — 3A + 2140 
Note that if the multiple eigenvalues of A are not linked in a Jordan 


chain, the minimal polynomial is of lower degree than the characteristic 
polynomial. Consider the following example: 


2 0) 0 
A= 0 2 0 
03 1 
The characteristic polynomial is given by 
2—X 0 0 
|A—AI| =| 0 2—X 0 |2Q2—2AXy(1—2) 
0 3 1—2AX 


A simple computation reveals that A has three eigenvectors, and the Jordan 
canonical form of A is given by 


2.0, 0 
020 
Q 9g 1 


Thus, the multiple eigenvalues are not linked. To obtain the minimal poly- 
nomial we first compute adj (A — AD), or 


(2 — XX1 — X) 0 0 
adj (A — AD = 0 (2 — AX1 — X) 0 
0 —32—A) (2—A) 
from which it is evident that 
da) =2—xX 
Hence 
40) = lg eN 
=) — 3A +2 
As a check, let us compute $(A). 
(A) =A? — 3A + 2I 
400 2: 0; 0! L 0 01 
—|04 0|—3|02 01 +20 1 0 
© 9:1 03 4 10 O 1j 
00 0 
=|0 0 0 
00 0 


For the given A the degree of the minimal polynomial is one lower than 
that of the characteristic polynomial. 


Inverse of A Notice that, by the use of the minimal poly- 
nomial, the inverse of a nonsingular matrix A can be expressed as a poly- 
nomial in A with scalar coefficients. Consider a nonsingular matrix A and its 
minimal polynomial $(AÀ) where 


(A) = A" + a, A"! + +++ + a, E— 0 
For a nonsingular matrix, a5 Æ 0. Hence, 
i= —L(A A arat e se aus A) 
Premultiplying with A-' we obtain 
A = -4am 4p a, Am? s Haaa) (5-2) 


Thus, A-! can be numerically computed according to Eq. (5-2). 
We shall next present a theorem on infinite series in an n x n matrix A. 
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Theorem 3-4 If the power series 
f(z) = cz 
k=0 
in a complex variable z converges everywhere, then the matrix power series 
Z c, A* 
k=0 


in an 2 X n matrix A converges absolutely. 


Proof We shall first Prove that the infinite series 2 c, À* is 
absolutely convergent if and only if 2 ||c,A*|| is convergent. Define y = (at. 


If Pi c, À* is absolutely convergent, "then there exists a constant M such that 


E celal X lalla« M 
(NL0sLJj-lI2,—-,) 
where M is independent of i, j, and N. Since 
Xx xX n n 
Dll Atll< X lel za] « nM 
k=0 k=0 i=l j= 
we conclude that X || c, A* || is convergent. 
k=0 
Conversely, if p || c A*|| is convergent, then the series consisting of the 
(i, j)th elements of X c,A* is absolutely convergent, since 
k=0 
| cat; |< |] ce A* || 
Hence we have proved that the series Y; c,A* is absolutely convergent if and 
k=0 


only if > ||.c,A*|| is convergent. 
k=0 


Now by the assumption in the theorem we have 


DilleeA* ll < X Lee (AIDE < M’ 


where M' is a constant. It follows that x || c, A* || is convergent. Hence the 
k=0 
theorem. 


Exponential function of square matrix In the scalar case e 
is defined by 


@=il4t2+ han 


5i WS eL. ouais 


l 
31 D 
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Since this power series 
s 
Zk 
converges everywhere, the matrix power series 
e lux 
— At‘ 
2, ki 


converges absolutely for any square matrix A. The exponential function of 
a matrix can thus be defined for every square matrix A by 


e^ — I4 A d uA! AS ee 


where we used I for A^. Similarly, since cos z and sin z converge everywhere, 
we have 


sin A = A — d; A + a At 

By the definition of the exponential function of A, the exponential func- 

tion of At can be written as 
e^t = I + At + JA +. + arate oti. 

This series converges absolutely for all finite t. (Refer to Prob. A-5-5.) 

Let us next examine the derivative of e^t with respect to t. Differentiation 
of e^ with respect to ¢ yields 
n=] 


S (eh!) = Ap Att eee ART 


n [^ : eee 
=A(I + Art AU EM ) 
= Ae^ 

[n7 
= At. see ee see 
= (I+ Art ++ +r te My JA 
= eMAÀ 


This relationship is identical with the scalar case. Let us examine g^ gs, 


s Au S ane, 
At pAs — 
ails -(E n! X n! 
ES eo n t^g" k ) 
= 2A" (E rim 
a(s + ty" = ghGto 
n=0 n! 


Hence 
eAtedAs — eA(tts) 
In particular, if s = —t, then 
erte-At — g7ALgAt — pAt-t) — 


Therefore, the inverse of e^ is e-^', Since the inverse exists, e^ is nonsingular. 


In general e‘4+®* and e^ e*' are not equal unless A and B commute, or 
AB = BA. 


Let us examine the difference between e‘4+®’! and e^! eBt, We have 


ear I (A+ By + OG Bng AB... 


and 
Mr Meo HA NS 
„(1+ xo ES.) 
— I4 (A+B) +Ê Ape + Be 
Hence 


efA+B)t __ gAtgBt 


_(A+B)(A+B),._ A?+2AB+B* , 
-—— P 


y (A + B(A + B(A + B,; | A' + 3A*B+ 3AB? + B', 
3! 


3! 
MT: 
. BA — AB, 
2 
„BA? + ABA + B'A + BAB — 2A°B — 2AB' ,, 
3! 
+- 


If A and B commute, or AB = BA, we see that 
g^ PU __ e^t gBt —0 

for all t. Hence, 
g(A*BXt. — gAtgBt if A and B commute 


If A and B do not commute, then e^*P"' is not equal to e^ e®*. 


As an illustrative example, let us compute e^' where 


2 X 1 
A=|1 2 1 
EEZ 
Since 
) d 4d 100 
A-|i 1 1140 1 0|/=B+C 
ys] d 0 0 1 
where 
t i 1 100 
Bit al > Lis = |0 I 0 
Lond 0 0 1 


we see that B and C commute, or BC — CB. We have therefore 


g^ — gB*Ot — gBt ett 


e*t is given by 


ect — elt 
2 TI 
sitet e mic T 
=e] 


e® is given by 
Br" 


n! 


ui 242 
"=B ÈE 4+ + FER 


where 
B: =3B, B= 3B? = 3B, 


Hence 


g = 14 1B + LGB) + ROB + --- 
my ont, 
1+ GF COE L... 


ak +5 (et — 1)B 


Thus we have 


e^t = [x T. 4 (et — 1B | (eD 


ea. + (e — e)B 


Bt = 3?B? = 3°B,--- 
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l(e 2e) l("—e) l(e—e) 
=| Aene) Lilet + 2e) $ (et — e) 
liess dpe lagu 


The direct computation of e^ as just demonstrated is practical only for 
simple cases. We shall discuss alternate methods of computation of e^' in 
Section 6-4. 

We shall next present a theorem on the exponential function of A. 


Theorem 5-5 For an n x n matrix A having n distinct 
eigenvectors, e^ can be written as 
e^ = PMP"! (P — n x n nonsingular matrix) 
where 
je 9 
e 
M = 
0 es 
and N, A», ... , A, are the eigenvalues of A which are not necessarily distinct. 


Proof e^ is given as 


Woa 
Since A can be written as 
A = PDP" 
where 
Ai 0 
E 
Dice z 
0 Xn 
we obtain 


A? = PDP^!PDP-' = PDP: 
Therefore, A* can be written as follows: 
A*t = PD‘ P~ 


ap eee em OE PIS pt i tener ee oe 
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where 
AF 0 
X 
D: = 
0 Ak 
Thus, 
= 1 7 0 
% 1 i: 
ete! En" 
anrs 
0 xl 
moe ^ 
e 0 
e 
0 e 
=M 


Hence we obtain 
$ Lp -EÉr(La)e-rP-M 
iain) ali 2 ah Lali: 


This proves the theorem. 


5-3 SYLVESTER’S INTERPOLATION FORMULA 


In this section we shall first present the well-known Lagrange’s interpolation 
formula and then give detailed discussions of Sylvester’s interpolation 
formula. 
Consider the following polynomial of degree m — 1, where i, As, - - -Àm 
are assumed to be distinct: 
(X a. An) Seo (A € X)0 SEE Akai) Mr (X -— Xx) 
pm Fog) e Ca Aea ua) ^7 Du — ad 


(Kk 1:2, m) 


Notice that 
PA) = 1 iff i=k 
=0 221 —. 
Consider next the following polynomial JA) of degree m — 1. 


f09 = È f04)p.0) 
E ) Oi Ka) (A. — Da. DU aa] i 1.3 
= ES Qu — X 


) 0. = Ne) — Meer) >> Oe — Am) 
(5-3) 
The polynomial /(X) takes on values Sx) at points r,. Equation (5-3) is 
the Lagrange’s interpolation formula. The (m — 1)th-degree polynomial 
JA) is determined from m independent data fO), JOa) « 5 s). 

Notice that the (m — l)th-degree polynomial J) passes through m 
points JA), JA ... > Jm). Therefore, f) is uniquely determined. Any 
other representations of the polynomial of degree m — 1 can be reduced 
to the Lagrange polynomial JA). 

We can extend this result to the case of a polynomial f(A) in an n x n 
matrix A. Let f(A) bea polynomial of degree N. Then f, (A) can be divided 
by the minimal polynomial Q(A), yielding 

Jx(A) = g(A)(A) + a. (A) 


where a (A) is the remainder. If the minimal polynomial Q(A) is of degree 
m, then a (A) is a polynomial in A of degree m — 1 or less. Since (A) is the 
minimal polynomial, 


f(A) = 0 
Hence, 
Sx(A) = ay(A) (5-4) 
If fy(A) is a convergent infinite series, which we shall denote f(A), then 


@y(A) converges to a unique polynomial a(A) as N — oo, We shall prove 
this in the following theorem: 


Theorem 5-6 A convergent infinite series f(A) = $ c,A* of 
k 


c=0 
ann X n matrix À can be expressed as a unique polynomial in A of degree 
m — lorless, where m is the degree of the minimal polynomial of A. 


Proof Let us define 


Ju(A) = X c, At 


Since A* can be expressed as 


At = E SAC (E20) 
we can write Á 
Xs (A). = > Gy, At = dyol + Ay, A+ +++ + ay su A"! 
where 
poem e Cees 


Hence 


lim f(A) = lim (dyol + ayi A + +- + Qy,m-1A™) 
N= N-— 


By assumption, the series lim fy(A) converges to f(A). Since the matrices 
N= : ; 
I, A, A*,..., A"-! are linearly independent, if f(A) is expressed as a sum of 
I, A, A*,... , A7-!, the coefficients are uniquely determined. Hence 
lim f(A) = f(A) - 


(lim @yo)I + (lim @y,)A + +++ + (lim Ay.m-\)A™™ 
N =c ya Ne 


CA ETA. + 4-0 ATO! 


where a; = lim ay; (12 0,1. mi L) 
No 


This proves the theorem. À ; 
By the use of this theorem we can write a convergent infinite series f(A) of 
A as a polynomial in A of degree m — 1 or less, where m is the degree of the 
minimal polynomial of A, or 
J(A) = aA) (5-5) 
where @(A) is a polynomial in A of degree m — 1 or less. From Eq. (5-4) and 
Eq. (5-5), we conclude that a polynomial in A of any degree can be expressed 
as a polynomial in A of (m — 1)th-degree or less. 
Since the following analysis applies to both fy(A) and f(A), we shall drop 


the subscript N. T 
We now derive Sylvester's interpolation formula. We first consider the 


case where the minimal polynomial of A does not involve multiple roots. 
Then we shall treat the case where the minimal polynomial of A does involve 
multiple roots. 
Case 1: where the minimal polynomial involves only distinct roots 
Substituting A for X in the polynomial p,(A) we obtain 
— al) --- (A — Age D(A —344D --- (A - A4D 
MULNM TID PUE a TER] 
P(A) is a polynomial in A of (m — 1)th-degree. Notice that 
p.O4D-1 if i=k 
=0 if ixk 
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Hence we can write 


f(A) = DS Q)p.(A) 
a (A — AD = ++ (A — Age (A — Arai D) +++ (A — AnD 
= p WM oie ec: Du) ESR 1 li uo) Gad! A C REN, ot ee? ES lth a 
e OW E) e Dac Mei Os — Xen) *** Qu — Au) 
(5-6) 
Equation (5-6) is known as Sylvester's interpolation formula. Equation (5-6) is 
equivalent to the following equation: 


1 1 sss d I 
Ay Ae =e Àm A 
M Abo M A? 
- (5-7) 
ARI ANS y aigue Am-1 
fO4) FAs) fO) f(A) 


[Solving Eq. (5-7) for f(A) we obtain Eq. (5-6). Refer to Prob. A-5-4.] Equation 
(5-7) can also be written as 


Do M oce XU SA) 
l M M ATUS Sa) 
=0 (5-8) 
1 Am An Am £m) 
I A A? «e+ Amt f(A) 


Notice that solving Eq. (5-8) for f(A) is the same as solving the following set 
of m linear equations for the a; (i = 0,1,2,...,m — 1) 


Ay + yA, + Ayr} -E s Se Qui Vaa —f(Xj) 
Ao + Aio + AN + oes HO ATI! = f (Aa) 


TE s a: $$. (5-9) 
Go +. X + 03 AZ, F re + Amy AM! = f(Am) 

and obtaining f(A) as 
F(A) = a4 + a, À + a, A? + «a, ATH . (5-10) 


Notice also that Eq. (5-10) is equivalent to Eq. (5-4) or Eq. (5-5). Therefore, Eq. 

(5-9) and Eq. (5-10) can be written as 
SA) = ar) 
f(A) = a(A) 


or 
SAN = g(a) P(A) + aA), (A) = minimal polynomial 
A = Xp Apese AmA) 
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Case 2: where the minimal polynomial involves multiple roots We 
shall next consider the case where the minimal polynomial (A) has multiple roots. 
As a simple example, let us assume that $(X) involves three equal roots; that is, 
the roots of f(A) are Ay, Ais X4, Ags Xs. + + + 3 Ame Then we may write 

P(A) =A” + aA! Hes agi + am 
= (A — MPA — Ag)(A — 33) +++ (A — Am) 
Similar to Case 1 we obtain 
fA) = BA) PA) + a0) 
= g (A) [(A — X3. — 34) ++ (A — 24)] + AA) (5-11) 
By substituting X = X, X4, .. , Am into Eq. (5-11), we obtain the following m — 2 
equations: 


fOr) =a) 


FS (Ay) = ary) (5-12) 
fm) = &Am) 
By differentiating Eq. (5-11) with respect to A, we obtain 
d d , 
alm = (A — APA) + T a(r) (5-13) 
where 
A — ADAN) = A Cer — 0. — AQ) +++ OA — Aw] 
Substitution of X == A, into Eq. (5-13) results in 
d dia ia its 
AO) m rre = Fe], (5-14) 


Similarly, differentiating Eq. (5-11) twice with respect to X and substituting X = X, 
we obtain 


Hro m 709 = FRAO, . (5-15) 
Rewriting Eq. (5-12), Eq. (5-14), and Eq. (5-15) we have 
Q + 3a3X, + --- + Mn) AE? =f) 
Qt, + 2a; + 385M + +++ + (m— D) asa AT? =f") 
Gy + Ayr, + GM + oes Ry UNO = fA) "^ 


Ay + 0424 + Qs M peee fla ATA =f) 


do + Nm + 0, Hts H Ga NR =S Am) 


From Eq. (5-16) we can determine the a, (k = 0, 1,2,..., m — 1) in terms of 
fd, S'A), and f"(A,). Then f(A) can be given by 


f(A) = a,I + aA + a5 A? + 8) + a, A! (5-17) 


Equations (5-16) and Eq. (5-17) can, of course, be combined in the following way: 


00 1 5», (m — D(m — 2) «s Pod) 
0 1 24 3 os (m — l)y-? f(r) 
Io» af M o Amd S) 
l AQ HE RE oe Ago fA) | =0 (5-18) 
T XXE XE x AB fO) 
I A A? AS... Am-i S(A) 


If the minimal polynomial has j multiple roots; that is, the roots of (À) are 
Map Mags rns y NEEG Mags s su Ny then f(A) can be given by 


f (A) =a,I+a,A + AA? + «e. F au Am} 


where the à, (k = 0,1,2,... , m — 1) are determined by solving for the coefficients 
a; of the following m equations: 


Li 
dig (8 Fy HOM eb an MED m SOO) 
1 (5-19) 
Zo + yA; + AAG beo Am- MPO = f(A) 
($21,2,..5,J. — l;ij]j41,...,m) 
[Equations (5-19) can be solved for the a, (Kk = 0, 1,2,...,m — 1), since the 
determinant of the coefficients is always different from zero. Notice that in this 
case again the number of the €, to be determined is m, which is the degree of the 
minimal polynomial.] Extensions to other cases where the minimal polynomial 
involves more than two sets of multiple roots are obvious. (The differentiating 
process is repeated for each set of multiple roots.) 
Sylvester's interpolation formula is useful in reducing the exponential function 
of A into a square matrix. We shall present in Section 6-4 a computational tech- 
nique for the exponential function of A based on Sylvester's interpolation formula. 


5-4 QUADRATIC FORMS AND HERMITIAN FORMS 


Quadratic forms appear frequently in problems of stability analysis of 
dynamic systems by the second method of Liapunov and in problems of 
optimization, that is, maximizing or minimizing values of functions of 
variables. 

Quadratic forms x'Ax may be considered as a special case of more gen- 
eral Hermitian forms x* Ax. In fact if A is real symmetric and the variables 
are constrained to be real, then the Hermitian form x* Ax becomes the 


Nen empresa rapuit i VEO AMERY SED 


quadratic form x'Ax. (If the matrix A is real symmetric and the variables 
are complex, then x* Ax is called a real Hermitian form.) 

Since in the field of control systems quadratic forms appear more fre- 
quently than Hermitian forms, we shall, in this section and the next, devote 
much attention to quadratic forms. Notice that results obtained for quadratic 
forms can easily be extended to Hermitian forms. 

We shall here give two definitions which we shall use in the subsequent 
treatment of the subjects: 


Rank of quadratic form (or Hermitian form) A quadratic 
form x'Ax (or a Hermitian form x* Ax) is called of rank r if A is of rank r. 


Diagonal quadratic form (or diagonal Hermitian form) A 
quadratic form x’Ax (or a Hermitian form x*Ax) is called a diagonal 
quadratic form (or a diagonal Hermitian form) if A is a diagonal matrix. 

The subjects to be discussed in this section are diagonalization of real 
symmetric matrices, Sylvester’s law of inertia, definiteness, semidefiniteness, 
and indefiniteness of real quadratic forms and Hermitian forms, and simul- 
taneous reduction of real quadratic forms to diagonal quadratic forms. | 

We shall now consider a theorem on diagonalization of real symmetric 
matrices. 


Theorem 5-7 Every n X n real symmetric matrix A can be 


expressed as 
A = TDT' = TDT- 
where 


0 An 
and T is an orthogonal matrix. Also A* can be expressed as 
A*t = TDT = TD: T~ 
where k = 1, 2, 3,... 


Proof A proof of the first half of this theorem was given in 
Prob. A-3-3. Here we shall give a more direct proof. Let us define the nor- 
malized eigenvectors associated with n eigenvalues of the matrix A to be 


Xj, Xo, -- +5 Xn 
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namely, 
(Xi, Xi) = l; (Xi, Xj) =0 
(nje-132...-) 


Q Jj) 


Let us form the matrix T whose columns consist of the vectors x,: 


Xn Xa e Xia 
Xor Xa 000 Xon 
Tq | * 
Xni Xns T PPS Xnn 
then 
1 0 
1 
T = 
0 l 


(T is an orthogonal matrix, or T' = T-'). Notice that since (A — XjI)x, = 0 
we obtain 


ÀiX tt aX in Xi 0 
AXo -4e Xa Xen Xs 
AT = . . esp 
nM ttt AnXnn 0 Xn 
and i 
Ai 0 
Ao 
T'AT = 
0 As 
Therefore 
Neo 
A=T T' 
0 Jus 


——————M m Ó— M OTN 
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or 
A-—TDI —TDI- 
Once the first half of the theorem is proved, then the second half can be 
proved immediately. A? can be written as 
A? = (TDT^ (TDT’) = TDT' TDT' 

Since T T— I 

A?’ = TD? T’ = TD'T-! 
Similarly, 

A? = (TD? T) (TDT) = TD’ T = TD' T^ 

We thus obtain fork = 1,2, 3, ... 

At = TD* T' = TD* T-! 
This completes the proof. 


Diagonalization of quadratic form From Theorem 5-7 it is 
clear that by choosing a set of orthonormal eigenvectors of A as the column 
vectors of T, the transformation x — Ty reduces the quadratic form 


n n 
Q=wNAx= 5 2 Gij Xi Xj (ai; = a5) 


i=l j= 


to 
Q = yTATy = XX 
In the following we shall consider an example. 
Example 5-1 Consider the following quadratic form: 
Q = 2x} + 22d + 4x9 + 2x, xX + 24/10 xix; + 24/10 xo xs. 


Let us find an orthogonal transformation matrix T that reduces Q to the following 


form: 
Q =y + Xsy$ + Ay 


where Aj, Az, and A; are the eigenvalues of the square matrix associated with Q. 
The quadratic form Q can be written as follows: 


2 1 4A/10][x] 
O= Dx.xs xs] 1 2 A101]! x, 
A10 4/10 4 lx 


XX 
where 
X 2 1 4/10 
s= xalh A=) I 2. w/10 


Notice that A is a symmetric matrix. The characteristic equation is 


2—2AX 1 / 10 
lA—M|-| 1 2-2 vO =ð 
NIO A10 X 


The eigenvalues are found to be 
A = —l, `=], A; = 8 
A transformation matrix can be determined from Eq. (3-18) as follows: 


Sean Ti 
5 7 14 
—2410 0 7410 


Each column of this matrix is an eigenvector of A. Normalizing the eigenvectors, 
we obtain the transformation matrix T as follows: 


5 1 2 
VE up 
5 1 2 
IA vr 
-2 9 wv 
3 3 
(Notice that T’T = I.) The transformation 
x — Ty 
reduces 
Q — x'Ax 
to 
Q — y'T'ATy — y'Dy 
=] 60: OT Fy, 
=[¥ » »]| 0 1 Ol] » 
00 8j||5 


= —y} +y + 8y 
In other words, by the transformation 


uA Ge) 1 2 
UU IB TE der xs 

ao Sd A2 
"m TER LA 


the quadratic form 


Q = 2xi + 2x5 + 4x9 + 2x, x, + 2/10 x, x, + 24/10 Xp x; 


Eyen 


can be reduced to 
Q = —y + Xx + 85 


Diagonalization of Hermitian form Diagonalization of Her- 
mitian forms can be carried out by essentially the same technique as that 
employed for quadratic forms. Since only minor modifications are necessary, 
we do not discuss it in detail, but merely give an illustrative example. 


Example 5-2 Consider the following Hermitian form: 
H = XQx, + 2X4x, + X3x5 
tO 2) T-j4 2)» 
FIS Xx, — j/ 3 XQx, 
Let us find a unitary transformation U that converts H into the following form: 
H =A, VY: Nas + M is 


where A, A», and A; are the eigenvalues of the square matrix associated with H. 
The given Hermitian form can be written as 


1 1+j/2 0 x, 
H —[X, X, xj] 1—j4/2 2 j^ 3 X» 
0 —j443 1 Xs 


Referring to Example 3-1, the eigenvalues of A are 
M=1 ——lh -—4 
The unitary matrix-U whose columns consist of the normalized eigenvectors of A 
was obtained in Example 3-1 as 
—j4/ 3 14+j/2 ltj4W2 | 
46 «id i5 


6 A/ 10 A/15 
The transformation x = Uy will transform H = x*Ax into 
1 0 OTH 


H = y*U*AUy = y*Dy —[j y: Js] |0 —1 Ol|» 
0 0 4/L», 


In other words, by the transformation, 


GENE usa DESEE y. 
Ex DA e Jm de J15 
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eee 3 
X» = Vio? * ps? 


a lm4E, JE, ue, 
em MR OU VIP upon 


the Hermitian form 
H = XX, + 2X4Xs + X3X3 
+ (Lt jf V2) 5x2 + (1 —j4/2)X: 
Tj 3 34x, — jV 3 X3X 
can be transformed into 


H = Ji Vi — Jays + 44s 


Theorem 5-8 (Sylvester's law of inertia) If a quadratic form 
X'Ax of rank m is reduced by a nonsingular transformation x = Ty to the 
diagonal form 


ayi + days + +--+ d ayy, + agua tere + Use p*a 


then the number p of positive coefficients £i ti -..,4, and the number 
q of negative coefficients ap, a4,,,, ...,a,., are uniquely determined by 
A. Moreover, p is equal to the number of positive eigenvalues of A and q is 
equal to the number of negative eigenvalues of A. 


Proof Let us define 


Q, — a,yt + c o ay + pipri F tt + Gao Lea (5-20) 
Consider a nonsingular transformation x = Sz which reduces x’Ax into 

Q: ec biz pe ite eT b.zi + Dries ape eer D esZies (5-21) 
where Di, bo, ..., b, are positive and brey Drs, ..., b, are negative. We 


shall first prove that p = rand q = s. 
Since p +q = rank A =m, we have p+q=r-+s. Since the two 
forms Eq. (5-20) and Eq. (5-21) are obtained from the same x'Ax, we have 


TS 


p p*q T 
p ayi +È ayi= 2 bzi +} bz 


t=p+l i=r+l 


(5-22) 


Since y and z are related to x by 
yo Tx 
z—S-x 


respectively, Fis Ys, o» «y X, and zi, ‘Zips cavers 
functions. of 2; xe, os «vd 


, Zn can be expressed as linear 


MATRIX ANALYSIS 269 


Suppose that p >r. Then a set of n — p+r linear homogeneous 
equations 
y-0 (ptl<ign) 
z; — 0 0 «j«r) 
in the variables x,, X2, . . . , Xn possesses a nontrivial solution 
X = Xj 
This is because 


number of equations =n — p +r 
<n 
= number of variables 


Define 
Yio 0 
J20 i 
0 
y-—T'!x,—| Yoo |; Z = Sx = | Zr+1,0 
0 Zre2,0 
0 éno 


From Eq. (5-22) we have 
TAS x p+aq * 
San- È at= hbd- D aX 
i=l i=r+l i=1 i=p+l 
But the right-hand side of this equation is equal to zero, since z, = 0 G = I, 
2,...,r)and y; —- 0(j 2 p + Lp 4- 2,...,p +q). This is a contradiction, 


since 
Jio Zr41,0 


+ 0, . +0 


Ypo Zno 
Therefore, p <r. Similarly, we can prove that r < p. Hence p = r. Since 
p +q =r + sit follows that q = s. Hence we have proved that 
| pr and g=s 
We shall next prove that p is equal to the number of positive eigenvalues 
of A and q is equal to the number of negative eigenvalues of A. Since A has 


rank m there are m nonzero eigenvalues. Let us define positive eigenvalues as 
Ay, Ao, <- . , A, and negative eigenvalues as 2,44, Anse, -- +> Anse (A + km). i 1 
By Theorem 5-7 there exists an orthogonal matrix T such that the trans- į ^, 


formation x — TÉ will reduce x'Ax into the following diagonal quadratic | 
form: 


^ 0 f E 


x'Ax = Ẹ'T'ATẸ = Ẹ' 


enan a a 
- 


Hence 
X'Ax = AEE t +++ + ADE + nar Eher + c ufi 
(h + k =m) 


Because of the uniqueness of p and q which we have proved, we obtain 


then 


X; 0 
Ae 
P'AP = D'T'ATD = D' ee D 


p=rh and qug =k 


This completes the proof. 
Notice that the reduction to the diagonal form is not unique. The elements 
in the diagonal matrix T'AT are not unique, except that the number of 


positive elements and that of negative elements are uniquely determined 
by A. 


Theorem 5-9 A quadratic form x’Ax of rank m can be re- 
duced by a real nonsingular transformation x — Py to the diagonal form 


Node ec PS an 77 Jp 


where the numbers p and q are, respectively, the number of positive 
eigenvalues and that of negative eigenvalues of A; and p + 4 = m. 


Proof Since A is a real symmetric matrix, the eigenvalues 

Ay, Ae, ... , An are real. Let us assume that eigenvalues X,, A», ..., Ap are 

positive real, X544, X545, - - - , X544 are negative real and X54444, Aptates - - -> 

An are zero. Let T be an orthogonal matrix such that T'AT = diag (Ay, A», 
. , Xn). Let us define P by 
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Hence we obtain 
X'Ax = y’ P'APy 
=e XX 7 X 
This proves the theorem. 


Signature of quadratic form x'Ax (or Hermitian form x*Ax) 
The number p — q is called the signature of the quadratic form x'Ax (or 
Hermitian form x* Ax) or of the matrix A. 
Notice that for two real symmetric matrices A and B to be similar, it is 
necessary and sufficient that the ranks of A and B be the same and the 
signatures of A and B be the same. 


Definiteness, semidefiniteness, and indefiniteness of quadratic 
form The quadratic form 


y» is de ho Jia. 535 Dene ON cee F e 0y; 


1 Positive definite if p = n, q = 0 

2 Positive semidefinite if p < n, q = 0 
3 Negative definite if p = 0, q = n 

4 Negative semidefinite if p = 0, q < n 
5 Indefinite if pq + 0 


Definiteness, semidefiniteness and indefiniteness of Hermitian 
form Since the Hermitian form yields real values, it can also be classified 
as positive (negative) definite, positive (negative) semidefinite, and indefinite. 
A Hermitian form x 


Ji t c IP — Joross — 057 — JssaYpsa + ÜYpeasiYpsai 
=- eee + Oyays 


1 Positive definite if p = n, q = 0 

2 Positive semidefinite if p < n, q = 0 
3 Negative definite if p = 0, q = n 

4 Negative semidefinite if p = 0, q < n 
5 Indefinite if pg + 0 


A real symmetric matrix A or a Hermitian matrix A is called positive 
(negative) definite, positive (negative) semidefinite, or indefinite, if the 
corresponding real quadratic form x’Ax or the corresponding Hermitian 
form x*Ax is positive (negative) definite, positive (negative) semidefinite, or 
indefinite, respectively. 
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Notice that a real symmetric matrix A or Hermitian matrix A is positive 
definite implies that A, > 0, 34 > 0,...,An > 0. Therefore, if A is positive 
definite, then 


n 
|Al= [Ja >0 

i=l 
Thus we find that the determinant of a positive definite matrix A is positive. 
Similarly, we find the determinant of a negative definite matrix A is negative 
if nis odd, positive if n is even; and the determinant of a semi-definite 
matrix is zero. Hence a positive or negative definite matrix A always has an 
inverse. 

Notice that if A, is an eigenvalue of A, then Aj! is an eigenvalue of A-, 
[Refer to Eq. (2-6) on page 77.] Hence A being positive (negative) definite 
implies that A-' being positive (negative) definite. 

Notice also that if x'Ax (or x*Ax) is positive definite and x = Ty is 
a nonsingular transformation, then y'T'ATy (or y*T*ATy) is also positive 
definite. Therefore. A being positive definite implies T'AT (or T*AT) being 
positive definite and vice versa. 

We shall next present a theorem on Hermitian forms. 


Theorem 5-10 If a Hermitian form x*Ax of rank m is re- 
duced by a nonsingular transformation x — Ty to the diagonal form 


a iyi c + apoyo + Apri V pei Yer + `+: + ApsgVp+aV +a 
then the number p of positive coefficients a,, as, ..., a, and the number 
q of negative coefficients 45.;, 4545, +++ ; Ap+q are uniquely determined by A. 


Moreover, p is equal to the number of positive eigenvalues of A and q is 
equal to the number of negative eigenvalues of A, and thus p + q = m. 


Proof We shall leave the proof to the reader, since it is quite 
similar to that of Theorem 5-8. 
We shall next present a theorem on positive definite real symmetric 
matrices. 


Theorem 5-11 If A is an n x n positive definite real sym- 
metric matrix, then there exists a nonsingular matrix S such that 


A = SS' 


Proof It has been proved that there exists an orthogonal 
matrix T such that 


T'AT = D = diag Q4, Aes ..., An) 


Since A is assumed to be real symmetric and positive definite, all the eigen- 
values Ai, A», .. . , An are positive. Hence 


diag (V Nrs As, sise , An) 


is a real diagonal matrix. We can therefore write 


~N O1[ AX, 0 
AP Ss AA 
A=TDT' =T . ; T' 
0 A Xn} LO AV Xn 
= SS’ 
where S = T diag (/X,, Xs, .--, A] An) 


This completes the proof. 


We shall now consider simultaneous reduction of quadratic forms into 
diagonal forms. 


Theorem 5-12 If A isan n xn positive definite real sym- 
metric matrix and B is ann x n real symmetric matrix, then there exists a 
nonsingular matrix S such that 


S'AS — I 
and 
S'BS = diag (u,, 1, .. . , Hn) 
where Hi, Ho, . . . , Hn are the eigenvalues of |B — LA | — 0. 


Proof It has been proved that there exists an orthogonal 
matrix T such that 


T'AT = D = diag (A,, Ae, .- - p An) 


where the 2, are the eigenvalues of A. (The, are all positive real.) Let us 
define 


Q = T dias (Fe...) 


then 


=I 


Since B is real symmetric, Q’BQ is also a real symmetric matrix. There 
exists an orthogonal matrix Z such that Z'(Q'BQ)Z is diagonal. The diag- 


onal elements are the eigenvalues of Q'BQ and are the roots of |B — yA} 
— 0, since 
Q'BQ — pI = Q'BQ — uQ'AQ = Q'(B — LA)Q 
and | Q| + 0. Hence 
Z'(Q'BQ)Z = diag (Hs, Ha, -- - , Hn) 
Since the matrix Z is orthogonal, and Z'Z = I the desired matrix S is given 
by S — QZ. With this matrix S we obtain 
S'AS — Z/Q'AQZ — Z'IZ — I 
and 
S'BS = (Z'Q^) B(QZ) = diag (1, Ho, -~ -> Hn) 
This proves the theorem. 


Notice that the roots of |B — pA | = 0 are all real, since they are also 
the roots of |Q’ BQ — „I| — 0 and Q’ BQ is a real symmetric matrix. 
Example 5-3 The kinetic energy T and the potential energy V of 
à dynamic system are given as follows: 
T-53;—4x,X,-- 22i 
V = 2x? — 4x, X; + 6x; 
By use of a suitable transformation matrix S reduce the quadratic forms of the 
kinetic energy and potential energy into diagonal quadratic forms, or 
T=} +} 
V = ayi + ay; 
where a, and a, are constants. Obtain Lagrange's equations of motion in terms of 


y, and y,. 
T and V can be written as 


romali E-e 


2 ANEY 
2 —2 ; 
V = [x, x 5 " [z] = x'Ex 


Notice that A and B are real symmetric and both are positive definite. The eigen- 
values of A are A, = 1, X, = 6. The eigenvectors of A are found as 


a 2b 
P 
where a and b are constants. The orthogonal transformation matrix T is then 
obtained as 


2 

E : 

alan 1 8 1 
S wx. 
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Then Q is found as 

: 1 1 

=Td 

Se Uk IE om) 

1 2 

_ a 5 A/30 
2 d 

A5 A/30 


and Q'BQ becomes 
18 14 
5 A/150 
UN 
m. 150 15 


The eigenvalues of Q'BQ are the roots of | B — uA | = 0. Solving 


Q'BQ — 


2— 5u —2+2p 


|B — pA |= = 
| —2+ 2p 6 — 2yu| 


we obtain 4, = 4, xa = 4. Once the eigenvalues of Q'BQ are found to be 1 and 4 
the transformation matrix Z is obtained as follows: 


/ 6 7 
A/55 A/55 
_V/ 6 
A/55 A/55 
The desired transformation matrix S is given by 
4 1 
J/66 Vil 
1 3 
A/66 WT 
With the transformation x = Sy, the quadratic forms T and V are reduced to 
T = X'Ax = y'S'ASy = yt + y 
V = x'Bx = y'S'BSy = } y? + 4? 
The Lagrange's equations of motion are 
a (=) —$L -o 
dt Ny; Oy 


where L =T — V = Lagrangian 
For the given system we obtain 


S=QZ = 


(i = 1, 2) 


V+ 5y, =0 
Jo + 4y. =0 


which can be solved immediately. The solution in the original coordinates x, and 
x, is given by 
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_ 4 o 
a een US 


56g at — as Hh. 4 y 
2 = 66^! Jii? 


5-5 SYLVESTER'S CRITERIA FOR DEFINITENESS AND 
SEMIDEFINITENESS OF QUADRATIC FORMS AND 
i HERMITIAN FORMS 


In this section we shall first prove Sylvester’s criterion for positive definite- 
ness of quadratic forms. We shall then present Sylvester’s criteria for negative 
i definiteness and for positive or negative semidefiniteness of quadratic forms. 
Finally we shall state that Sylvester's criteria for definiteness or semidefinite- 
ness of quadratic “orms can be applied to Hermitian forms. 


Theorem 5-13 (Sylvester's criterion for positive definiteness of 
quadratic form) A necessary and sufficient condition in order that the 
quadratic form x'Ax, where A is an n X n real symmetric matrix, be positive 
definite is that the determinant of A be positive and the successive principal 
| minors of the determinant of A be positive, or 


ai Aye 


a > 0, > 0, Asi As d$, | > O, 
Azi Az 
43, Q39 33 
Qa Ay cc Ain 
Qj Qag ++ Gon 
det A = >0 (aij = aj) 
Any Qno *** Am 
Proof Assume that the quadratic form x'Ax is positive def- 


inite. In the following quadratic equation 


m m 
X'AX = 2 2 Aij XiX; (a, = a) 
i=l j= 
let us put 
| Xera = Xes St m XQ —0 [ 
j Then 
] k k : 
b > QGuXPX; (a7; = sr) 


r=ls=1 


is also positive definite. The coefficient matrix in this case is given by 


GU E E Aik 


If A, is positive definite, then | A, | > 0, or 
Gy =" Gig 
>0 


Hey i509 y 


Hence, we conclude that if the quadratic form x'/Ax is positive definite, then 
the successive principal minors of the determinant of A are positive. 
We shall next prove the converse; namely, we shall prove that if 


lA|-0 (=1,2,...,n) 


then x’Ax is positive definite. We shall use induction. For k — 1, it is clear. 
Assume that the result holds for the (k — 1) x (k — 1) matrix A,_,. Let us 
write the k x k real symmetric matrix A, as 


which can be rewritten as (refer to Prob. A-5-14): 


ke t] j pai a | EE 
AS. I 0 | ak — a'Azaljl0 ! 1 


From this we obtain 
[Ac] = | Ax- | (a — a'Az';a) 
Since by assumption | A,| > 0, and | Ax- | > 0, we obtain 


Qu — aAz',a>0 
Then 


+ (24 — a’Az!a)xt 


Xk-1 


Since A,_, is assumed to be positive definite and a,, — a’ Al, a > 0, we 
conclude that 


As- | 0 
x! | -----+------------- x>0 for x Æ 0 
0 'a,,—a'Aj;la 
Hence, x’Ax is also positive definite. This proves the theorem. 
We shall next present Sylvester's criterion for negative definiteness of 
quadratic forms. 


Theorem 5-14 (Sylvester's criterion for negative definiteness of 
quadratic form) A necessary and sufficient condition in order that the 
quadratic form x'Ax, where A is an n X n real symmetric matrix, be negative 
definite is that det A be positive if n is even and negative if n is odd, and the 
successive principal minors of even order be positive and the successive 
principal minors of odd order be negative, or 


E m 
a, « 0, á i [>o asy As a |<0, --- 
Q5, Are 
43; Az Azz 
det A > 0 if n = even 
det A <0 H n = odd 


(aij = a5) 


Proof The quadratic form x'Ax is negative definite if 
X'(—A)x is positive definite. A necessary and sufficient condition that 
x' (—A)x is positive definite is given by 


—a, aa “iy 
—a —a 
—a,, > 0, | s * > 0, —d, —Qx2 —GQ;3|>0, --- 
—0» —Are 
=a As; ~G 
det (—A) > 0 
It follows that 
A Qi, i 
a a 
a, <0, | - i 2x0, an An a |<0, --- 
Az Are 
Azı 03) Q33 
det A > 0 if n = even 
det A <0 if n=odd 


Hence the theorem. 
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We shall next present Sylvester's criteria for semidefiniteness of quadratic 
forms. 


Theorem 5-15 (Sylvester's criterion for positive semidefi- 
niteness of quadratic form) A necessary and sufficient condition in order 
that the quadratic form x'Ax, where A is a real symmetric matrix, bc 
positive semidefinite is that A be singular (det A = 0) and all the principal 
minors be nonnegative, or 


Aii Qij Gi 


aii Aij 
Aii = 0, > 0, Aji Aj; Ajk > 0, ET 
aj dj 
Aki Qj kk 
Qu Ayo Qin 
Qo, Az Gon 
det A = : =0 (ai; = aj) 

Gn, Ano al aaa Ann 


Theorem 5-16 (Sylvester’s criterion for negative semidefinite- 
ness of quadratic form) A necessary and sufficient condition in order 
that the quadratic form x'Ax, where A is a real symmetric matrix, bc 
negative semidefinite is that A be singular (det A = 0) and all the principal 


minors of even order be nonnegative and those of odd orders be non- 
positive, or 


Aii Qij Aik 


Aii Aij - ; 
aii € 0, > 0, Aji Ajj đjk < 0, 
aj; Ajj 
Aki Any Akk 
Ai Ai Gin 
A21 Azz Aon 
det A = . =0 d; =d 
ij ji 
Any Ang c0 Am 


Proofs of the last two theorems are similar to those of Theorems 5-13 
and 5-14. We shall leave proofs to the reader. 

As an example of application of Sylvester’s criteria, consider the fol- 
lowing quadratic form: 


10x] + x3 + 4x3 + Ax, X, + 6x,x; + 2x,x, = x'Ax 
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where 
10: 2 3 
A=|2 1 1 
3 14 


For this matrix A, det A > 0 and the successive principal minors are positive, 
or 
10 2 


10> 0, | 21 


2 

| > 0, 2 4 
3 | 

Hence x’Ax is positive definite. a 
As an another example, consider the following quadratic form: 


xi — 3xi + x + 4x, x, + 8x,x3 
1 


The matrix A associated with this quadratic form is 


1 2 4 

A=|2 —3 0 

4 0 1 
Since | 
i o dà ] 
1 2 | 
16; l; MED 2 —3 0|20 | 

A d ui 7 


x'Ax is neither positive definite nor negative definite. Since det A > 0, A is 
nonsingular and cannot be semidefinite. Hence we conclude that A is inde- 
finite. 

We have presented Sylvester’s criteria for quadratic forms. However, 
the same criteria can be derived for Hermitian forms. Theorems 5-13—5-16 
hold true for Hermitian forms with trivial modifications that x'Ax be 
changed to x*Ax, A be changed from a real symmetric matrix to a Hermitian 
matrix, and a,; = aj; be changed to aij = à; 


EXERCISE PROBLEMS 
AND SOLUTIONS 


Problem A-5-1 Obtain the minimal polynomials of the following 
matrices: 


110 
(0 |010 | 
002 | 


Solution 
(1) (A — 12(4 — 2) =0 
(2) ~—-1=0 


Problem A-5-2 Prove that any two polynomials in an n X n matrix 
A commute, that is, 


IAEA) = g(A)/(A) 


where 


fA) = X aA‘ and  g(A)— $ b, A 


Solution 


SAA) = ($ aA‘) (È bA’) 


1-0 J-0 


T s 
— X X a AibjA = Y, Y, a,b, Ate 
jJ=0 1207-0 


120 


- (X 5A (X, aA‘) = 20/0) 


Problem A-5-3 Prove that the minimal polynomial (à) of an 


n X n matrix A and the minimal polynomial $4., 44(A) of T-'AT are the same. 
(T is a nonsingular matrix.) 


Solution Notice that 
$4(T"'AT) = T [$b (A)T = 0 
Hence 6, is divisible by $y., ar- Since 
A = T(T^AT)T-! 
we have 
$y-1at(A) 
= $r-ıar [T(T'AT)T-'] 
= T[ġr-:ar(TAT)]T- = 0 


Hence $«.,4T is divisible by $,. Since the coefficients of the highest-degree term 
in both $4 and $4, 44 are unity, we conclude that 


rar) = 640) 


Problem A-5-4 Suppose that X,, Xz, - - - , Àm are distinct roots of 
the minimal polynomial of an n X n matrix A and that the minimal polynomial of 
A is a product of distinct linear factors. By solving the following equation for 


f(A): 


1 Y | I 
x Xs Ami A 
MOM Ah A 


AS Am wat Am-! Am-! 
SAD SA) +++ fO) F(A) 


we can obtain 
- (A —X]D +++ (A — Age) (A — XD ++ (A — Anl) 
= (A — AyD) --- VÀ — AA Bet) T 
fe) E/o0 Qu — Ay) *-* Qa — Ne) Ae — e) t On — Am) 
Verify this for m = 4. 
Solution By expanding the following matrix about the.last row, we 


obtain 
bti 1 1 I 


X Är — 4 — 5 A 


Asla uM M M. 
A3 Ai Ai Ai A3 
fA) fA) FAs) fA) F(A) 
i 2 1.3 & 4$ Y d 
As A. Meo A. Ae A A 
Ju» M Lu ns T 
az raz at az x? al mw A 
NF oD M Ai AP 2S) oM A3 
yop Be 4 ty d ^73 
Oe ee A chee Je. A 
= AL X 
+409) 33 ag ag ar (LO ] ag ag at at 
NON AZ AP NNA 
1 1 1 I 
Aw Xa A 
FO a 4 


AG M X A 

rE ab AF AP 

Since Vandermonde’s determinant has the property that 
1 | se d 
Ar Ap eee Na 
Ab d TE A 


IKicj&m 
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we obtain 
E ob X 1 
Xu Ung) Ay A 
X: A2 M Xl = (A, I Aa) Aa Pes Aa) Aa = AAs = A2) (Az = Ai) Ae E i) 
Aj AR OM M 
and 
I] d$ d$ T 
Ai Ay X. 


Moat ag az] A MDA — ADA — 00,4 — 1304 — A904 — A) 


ri AS AZ CAB 
Therefore, we obtain 
A= f(A) IA, — A) Ag — AVDA, — 04 — Ae) Aa — A) Ae — 24)] 
A0) KA — X,D(A — MAINA — AINOA — Aa) As — X))04 — 34)] 
FSAA — X,D(A — A,I)(A — MD Ay — A) Ou — A) Ar — 24)] 
~ £0) [(A — à D(A — A,D(A =A A m As) (Ag = AAs — ,)] 
aie (A — X,D(A — ADA — ADA, AVA, — ANA — 4)] 
Solving for /(A), we get 
f(A) =a) A= AWA — AIA — AD) (A — XiD(A — X — 
, Ea —2A904 —A)0. —AQ 5409 A: — XU; = he = x 
+ f(y) A= X,D(A — A.D(A — A4I) (A —A D(A — A.D(A — 
_ | Os = APs = A90 — X). 709 ERG C MUR XE 
=E fa) (A — àI) --- (A — Ace) (A — Ape) --- (A — Aml) 
kel (X — Xi) SE (Ax — X4,)0. — Nesi) oes (Ak — X4) 
where m — 4. 


Problem A-5-5 Prove that, for i 
, . ; an n X n constant matrix 
series defining the exponential of At, namely, me 


At — Ls 1 
e D At Fay AME une de rr APER de ius 


is convergent for all A and for all finite b 
Solution Let 
At = (a;;t) and max|a;t| = M 
Put 7 
Amm = (at)? 
then we can prove that 
l(a;0) ^| n- M^ (If in;l Kj«n) (5-23) 


To prove inequality (5-23) let us use inducti - 
; x uction. Clearly th al 
inequality (5-23) is true for m — 1, then y this is true for m = 1. If 
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|(a;;t)7?? | < n™-2 M ^-1 
Hence 


1 (au t)7?a,5t 


M= 


|(a;t)™ | = 


3 
a 


< È | aint)" |] ast] 
Ln nm-? Mn-1M =n! M™ 
Therefore inequality (5-23) holds for m. Since for all finite M 


MMC e | 
— e = Fn 1M 
n mzo mn. 


converges, we see that the series 
= A. (a; ty 
Aim? 

also converges for all finite t. Therefore, the series 


Tt At du AM e ee 4e d ARR E ee 


converges for all constant matrix A and for all finite ¢. 
Problem A-5-6 Show that 
(| (A9*-(A9*  (k—12,...) 
(2) (e^9* 2n e^" 
where A is an n X n constant matrix and ż is a scalar. 
For (1), note that (A*)* can be written as 
(A*)* = Af A7 Af +++ AF AE 
where the subscript identifies the mth matrix A. Since the conjugate transpose of 
the product of matrices is equal to the product of their conjugate transposes in the 
reverse order; that is, (ABC)* = C*B*A* 
(A*)* = (Ay Age, +++ As A A,)* 
= (A*)* 
Thus we sec that (A*)* = (A*)*. 
For (2), e^' can be written as 


Solution 


a (At)* 
Az (At)* 
e I+ "3i 


Then, taking the conjugate transpose of both sides of this last equation 
e (At)^* 
e» - (829) 
k=1 . 


q SAE 
to ki 


but part (1) showed that 
(A*)* = (A*)* 


Hence, 


(e^)* =I + = B pare 
Problem A-5-7 Compute e^! where 
x 1 0 
A=]j0 A 1 
00 x 
Solution 
A. I 0 i 0 0 O i 0 
Or» 1|—X|O 1 O1+/0 0 1 
00 x 0 0 1 000 
Since the matrices on the right-hand side of the preceding equation commute, 
X, 1 0 AE O0 O 0 d£ 0 
exp|O à 1|r24exp|O At O|exp|O O ¢ 
002 0 0 Mm 000 
Since 
0:0 1 f ic 
exPT0: 0 =o i1 t 
000 00 1] 
we have 
» 10 |!» ous 
explo x 1|r-e*|o 1 ^ 
~ 0 X G A 
Problem A-5-8 Compute e“ where 
» 10 s 0 0 
0 X 1-00 
"S So mo ; i 
000 Xv 
0 0 0 0 x 
Solution Let 
At 0 0:1 0 +--+ O 
At o 0 £z 0 
Bt = j G.M * 
0 At 000 0 


Then we obtain 
e^t — gBteCt 


= mra Ct + AC + cm (cry | 


t? [n2 
3i ^ Q= 


L f 


pr-2 


Te eo 


00 0 vs 1 
00 0 =. 1 
Problem A-5-9 The derivative of | A(r)| can be given by. 


—€— rw 
arl ^07 2 2; Aug au 


Since 
1 
AW) = AG» 
we have 
1 d|A()].. ER Alt 
TAG) at tr( ME ()) 
By use of this relationship show that, for a constant matrix A, 


le^|— er^ 
(Hence |e^| = 1 implies that tr A = 0.) 
Solution Let A(t) — e^'. Then 


d a id gh At — eAt A 
ar A0 = ae = Ae e 


and 
1 dje*| _ 
pm a ^ 
Hence 


le^! = Ce crx 
By substituting z = 0 in this last equation, we find C = 1. By letting ? = 1, 
le^l- e^ 
Problem A-5-10 For a real symmetric matrix A with nonzero eigen- 
values Aj, Ae, ... , Àn Show that 
x 0 
Ag! 
A`! = $ T' (T — orthogonal matrix) 


0 Az! 
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Solution By Theorem 5-7, we may write 
T'AT = D = diag(,2,,..., An) 
Then, ; 
EAD = TASTY)! = D^! 
and we obtain 
TT AUT) T A7! — TD-'T’ 
Problem A-5-11 For a positive definite matrix A there exists an 
orthogonal matrix T such that 
f T'AT = D = diag (dj, da, ..., dp) 
where the d, are all positive. The matrix T defines an orthogonal transformation 
of coordinates reducing the quadratic form, x'/Ax = constant, to its axes. Show 
that if we put 
T'A"T-TI 
then 
DI = diag (1, 1, ..., 1) 
(Note that 


T =D = diag (Jogoo) 
is also positive definite.) 
Solution 
DT —T'ATT'A^T = TAAT = I = diag (1, 1,...; 1) 
Problem A-5-12 Show that if a scalar quantity f is defined by 
fx'Áx 
where A is an n X n real symmetric matrix, then 
grad f = 2Ax 
Solution If we write f in the following form: 
Anu dde ct Ain} | 
Qn, Gog >>- Aan Xa 
Sf = [xi x2 e Xa] 


Ani Ang aai Ann Xn 
aX + aixa + + + aX 
dp X; + aX Rott + do Xn 
= [xi X -++ x, 5 - 
Any Xi + Qao Xo T +++ + AnnXn 
= (aiXi + aX + +++ + AinXn) Xi 
+ (an Xi + AggX_ + +++ + Gon X4) Xs 
+ 


+ (Gn X1 + AngXe + *** + GnnXn) Xn 


MATRIX ANALYSIS 289 


where a,; = aji we find 


gr = (auxi + di Xo tere + GinXn) 


Ox; 
(Dex Dy Zire sies) 
Hence 

A Aan a Ain x; 

1 
gf Qo, Ag °° Gan X 

gadf-| 72). . BINMLE 

A Ani An? °°" Ann Xn 

n 


Problem A-5-13 The kinetic energy of a system is given by 


n n Se af 
2 2 Qi;diq;j 


t 


1 
ams 


J 
where the a;; are constants or functions of the q; and a;; = aj Show that if p; is 
defined by 


- T 
Pi eji 
then 
ie HE 
gi aD; 
Solution The kinetic energy T can be written as 
T = id'Aq 
where 
ji ay, Ay ct Ain 
d: Az deg *** Gan 
In Ani Gag 777 Ann 


Since T = 4q’Aq and p, is defined by 
eT 

pi — 2d. 

we obtain (see Prob. A-5-12) 
p = (grad T); = Aq 

Hence 

q — A"!p 
(Since the kinetic energy T is a positive definite quantity, A is nonsingular. Hence 
A^! exists.) We thus obtain 


T = $d'Ad = $(A^'p)'A(A^!p) 
= $(A"!p)'p = ip'A7!p 


Therefore, 
(grad T), = A^!p 
Hence 
q = (grad 7), 
or 
E oT 
gue ap: 
Problem A-5-14 Show that if 
Ani i à 
A = |-----4+--- 
af pg 
then 


- E] Iz PN AS | = 
a'Ai;11]L 0. Jama Azalt 1 | 


I i 0 -1j a An-1 | a 
ye ae | eee: — = | ----- bes 
a'A;, H 1 0 Ann — a'A;a a' i Ann 


Problem A-5-15 Show that the following quadratic form: 


—xi — 10x} — 4x$ + 6x, X, + 2X2 x 
is negative definite. 
Solution The given quadratic form can be written as 
a d. rx 
XAx = [x, x, x,]| 3 —10 1 Xy 
0 1 —4j]lx, 


The successive principal minors and the determinant are 


=] 3 0 
—] 3 
=i <0; > 0, 3 —10 1|«0 
3 —10 
0 1 —4 


By Theorem 5-14, x’Ax is negative definite. 


PROBLEMS 


Problem B-5-1 If the minimal polynomial of an z x n matrix A 


where 


A, 10 
A = |----+---- 
0 | As 


is defined by $4(X), then show that $,(A) is the minimum common factor of 
$4, (X) and alà), where $4,(X) and $4,(X) are the minimal polynomial of A, 
and that of A,, respectively. 


then 


Problem B-5-2 Show that if || A || < 1, then 
X A --—A)! 
i=0 


Problem B-5-3 Show that if 


01000 

00100 

A=|0 001 0 
00001 

lad 60 0 
00100 0 0 
00010 0 0 
A?=/0 0 0 0 1|, At=/a O 
a 0 0 0 0 0 a 
0a000 0 0 


ooont on coco 


x 
ll 
o A 
Q 
Q. 
ts) 

n , © 
E ES 
e 
ll 
=e | 
a oococeo°eo 

LI 
Oo Q Oo OnR 


oon 000000 ^- 
oo A OO fo Oo © O = Oo 


2 
p 


a? 


a? 0 
a a’ 
Al = a , AN = a 


0 a? 0 a 


Notice that similar results hold for any n x n matrix A of the following form: 


OL O me @ 
ó à 1 x 6 
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Problem B-5-4 For a skew symmetric matrix A, show that 
(gi Nu or (e^^5' (e^^) — I 
Problem B-5-5 Consider a square matrix A defined by 
A = diag (A,, As, ..., Am) 
where the A; are square matrices of orders r; (i = 1, 2,..., m). A? is then given by 
A? = diag (Aj, Aj,..., AX) 
and similarly 
A* = diag (A5, Ak... , AX) (k-—1,2,....) 
Hence we can obtain 
F(A) = diag (/(A), f(A), . -- -f(An)) 


where f(A) is any polynomial in A. Notice that if f(A) is an infinite series, it con- 
verges to a limit if and only if all the f(A;) converge to respective limits. 
By use of the foregoing relationship obtain sin A and cos A where 


ay, 0 


0 ann 
Verify that 
sin? A + cos? A = I 
Problem B-5-6 Prove that if the eigenvalues of an n x n matrix A 


are Ay, Ao,..-, An then the eigenvalues of A* are A5, A5... , A5, where k is an 
integer. 


Problem B-5-7 An n x n matrix A has eigenvalues X4, s... , An 
which are not necessarily distinct. Show that there exists a matrix T such that 


Ay 2: 
Xs 


0 Xn 
where + indicates that the elements above the diagonal are not necessarily zero. By 
use of this show also that 
3 M = tr (A*) 
i-1 
Problem B-5-8 Show that for a symmetric matrix A = (aj) 


2 
Ox, Ox, x’Ax = 2a; 
Problem B-5-9 Suppose that a nonsingular matrix P transforms 
a real symmetric matrix A such that P'AP = I. Show that 


T T rol —S ra 


i 
| 
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IP = 


where the à; are the eigenvalues of A. 

Problem B-5-I0 Prove that a Hermitian form x*Ax, where A is 
an n x n Hermitian matrix having n distinct nonzero eigenvalues Aj, Ag, ..., An 
and x is a complex n vector, can be reduced to the diagonal form : 


MP1 + Mysys + +++ + AnVaYn 
by a suitable unitary transformation x = Ty. 
Problem B-5-11 Find a suitable transformation matrix S such that 
x = Sy will simultaneously convert the following two quadratic forms: 
Q, = x'Ax = 82x] + 4.8xixs + 6.8x3 + 3x3 
Q, = x'Bx = x} + 2x3 + 3x3 + 2x, x3 + 2x, x3 + 2xsxs 
into 
Q; — yl +y 9 X 
Q, = ayi + byi + cy} 


where a, b, and c are constants. (Notice that the eigenvalues of A are 3, 5, 10, and, 
therefore, Q, is positive definite.) 
Problem B-5-12 Prove that for a real symmetric matrix A, the 
following statements are equivalent: 
(1) A is positive definite. 
(2) The determinant of A is positive and the successive principal minors of the 
determinant of A are positive. 
(3) All eigenvalues of A are positive. 
(4) There exists a nonsingular matrix B such that 
BB=A 
Problem B-5-13 Suppose that the elements a;; of an n X n matrix 
A are non-negative. Prove that a necessary and sufficient condition that the absolute 
values of the eigenvalues of A are all smaller than unity is given by 


] Gi — 412 tee —awx 
=y led des — ok 
>0 
— ak — xk» 1 — akk 
(A —1,2,...,n) 


—,—,1———7 


Obtaining the Solution of 


the State Space Equation 


6-1 INTRODUCTION 


State space representations of linear dynamic systems are very frequently 
in the form of linear vector matrix differential equations. The objective of 
this chapter is to present standard techniques for solving such linear vector 
matrix differential equations.' For numerical solutions by the use of digital 
computers, such linear vector matrix differential equations must be discre- 
tized. When discretized we will have vector matrix difference equations. 
Formal solutions of linear vector matrix difference equations are quite 
analogous to those of linear vector matrix differential equations. We, there- 
fore, include discussions of formal solutions of linear vector matrix difference 
equations in this chapter. 

The following sections of this chapter may be outlined as follows: 
Section 6-2 presents properties of the solutions of linear vector matrix 
differential equations. Formal solutions of such equations are derived in 
this section. Section 6-3 gives techniques for obtaining solutions of linear 
time-invariant vector matrix differential equations. It is shown that the 


1For detailed discussions of vector matrix differential equations see, for example (6-7, 
6-2). For applications of theory of differential equations to control systems see also (6-3). 
Figures in parentheses refer to books listed at the end of this chapter. 
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solution will include the term e^'. In Section 6-4 we shall discuss methods 
for computing e^' for a constant matrix A. Section 6-5 presents transfer 
matrices and two methods for computing (sl — A)". Section 6-6 gives 
analyses of impulse responses of dynamic systems. Finally, in Section 6-7 
we present formal solutions of linear vector matrix difference equations and 
discuss discretization of linear vector matrix differential equations. 


6-2 LINEAR VECTOR MATRIX DIFFERENTIAL 
EQUATIONS 


In this section we shall obtain formal solutions of state space equations 
which are in the following form of vector matrix differential equation: 
x = A(t)x + B(r)u (6-1) 

Here, we assume that x is an n vector; u, an 7 vector; A(t), ann X n matrix; 
and B(t), ann x r matrix. We further assume that A(r) and B(r) are absolntely 
integrable as functions of f in the interval 0 < : p t «T ortu Ct t 
T=t,. Then, it can be proved that, for given t, and x(f,), there 
exists a unique solution of Eq. (6-1) over the time interval £, X 1 X f. 

In this book we shall frequently treat the case where the elements of 
A(t) and B(t) are continuous functions of time. In such a case clearly A(z) 
and B(t) are absolutely integrable as functions of time. 


Basis functions and Wronskian Consider the linear homogene- 
ous vector matrix differential equation 


x = A(r)x (6-2) 


where x is an n vector and A(t) is ann x n matrix whose elements are 
assumed to be absolutely integrable as functions of ¢ in the interval 
t, < t < t,. Assume that Xj, X», - -+ > Xn are linearly independent solutions 
of Eq. (6-2). Any other solutions can be expressed as linear combinations of 
oige 3 Xp Once specify the initial condition the solution is uniquely 
determined.) We shall see later that a necessary and sufficient condition 
that Xy Xirs o -a Xw aTe linearly independent solutions of Eq. (6-2) is that the 
determinant of the matrix consisting of column vectors X; 15 nonzero, or the 
following determinant is nonzero: 


Mig agp 079 Man 
Xo, Xa Xon 

w= i #0 
Xni Xna eet Xnn 
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This determinant is called Wronskian. Wronskian plays an important role 
in theory of ordinary differential equations. 

Entire solutions of Eq. (6-2) form an n-dimensional vector space. Any 
set of n linearly independent solutions constitutes a basis of the r-dimen- 
sional vector space. For a homogeneous scalar differential equation, any 
set of n linearly independent solutions is called a set of basis functions. 

We shall next present several theorems on the solutions of linear dif- 
ferential equations. 


Theorem 6-1 If n differentiable functions x,(t), x,(t),..., 
x,(t) are linearly dependent for the time interval t, < t < t, then the 
Wronskian W(x,, x,,...,x,)iszero for ty « t « t. 


Proof The Wronskian W(x, Xz, ..., Xn) is given by 


X; Xs eae X 
s «no & 
W(x; Xa, > 263) gm 
(n-1) (n-1) (n-1) 
X; X, ... 3 


DINCE Xu. ess 
we have 


-> X, are linearly dependent in the time interval t « t < t;, 


exi (£) + eost) + +++ + enx,(t) = 0 (6-3) 


where not all of c,, c,, ..., c, are zero. Differentiating successively Eq. 
(6-3) with respect to 7, we have . 


cyX,(£) + C(t) + +++ + cx (t) = 0 
eX) + ex) --- + cxO)2-0 
(n-1) E inan mE ey. 

CyXy(0) + 0st) + +++ + CaXa(t) = 0 


Let ¢ = t, where t, «€ t, < ¢,. Then we have the following n simultaneous 
algebraic equations in c,, Ca, .. . Cn? 


exi (t5) + CyXo(ty) abe» aie of CaXn(ty) =0 
C,X (ty) + CoXo(te) + +++ + eux) = 0 
(n-1) "x pe e ids 

€yXi(f5) + exs(t;) + +++ + euxs(t;) = 0 


(j-1) 
where the x;(t,) (i,j = 1, 2,..., n) are constants. Since not all of c,, c,,..., 
€, are zero it follows that 


OBTAINING THE SOLUTION OF THE STATE SPACE EQUATION 297 


Xt) —x(t) cc 00 Xala) 
Kilts) Halts) ee X9) 

‘ ‘ . ex % 
—- R= : én-ij 
A) i xalta) — Xn (to) 


Since t, is arbitrary within the time interval t, & t, « tf, this last equation 
implies that for t, « t « 1, 


x,(t) x(t) x(t) 

H(t) x0 cc 0 x) 
Wis Xs eX) = : : : = ij 

tis Wa = a 


This completes the proof. l 
Note that the converse of Theorem 6-1 is not necessarily true. For exam- 


ple, consider 


x(t) = t’, x(t) = |¢/ 
Then, 
t? xD 
W(X X) =| ayo 4 3y2 


Clearly x,(t) and x,(t) are linearly independent in the interval —1 « 1 < 1. 


Theorem 6-2 Suppose that the Wronskian Wi, Xa uan 
Xn) of n differentiable functions x,(t), X,(t),---»Xn(t) is zero for. the time 
interval t, < t < t,. If Wronskian of n — 1 functions » ostio. xad) 
differs from zero for the entire time interval t, « t <4, then the n functions 
x(t), xt), ===  x«(t) are linearly dependent for the time interval t, Ct <4. 


Proof By assumption W(x, Xs, ..., Xn-1) #0 for the entire 
time interval t, < t < t,. Therefore, for any ¢ between to and /, there exists 
aset of n — 1 functions of t, that is, c,, Cz, . - - , Cn-1, Such that 


eyx(t) + ext) + 7: Cn-1Xn-1(t) = Xn(t) 
eX t) +:G.%,(t) + -s* + Cua Xu 4E) = x(t) (6-4) 


(n-2) 


(n-2) (n—-2) (n-2) 
eyxy(t) + ext) +--+ + Cn-1Xn-1(t) = Xn(t) 


The c, (i= 1,2,...,n — l)are differentiable as many times as desired. 


Let us put 


(n-1) (1-1) (n-1) (1-1) 
ex, (t) + ex(t) +- + Cs Xn E x,(t) +e (6-5) 
By multiplying both sides of the foregoing n equations [Eq. (6-4) and Eq. 
(n-2) (1-1) 
(6-5)] the cofactors of Xn, X4, ..., Xn x, in the n x n determinant Wa, 
Xo +++, Xn), respectively, and summing up the resulting equations we obtain 
0 = W(x, x, ...,xX) + cW(x,, X, . 
Since W(x,, x;,...,x,) = 0 and W(x,, x... 
Thus Eq. (6-5) becomes 


esos sn) 


» Xn-i) Æ 0, we obtain c = 0. 


(-1) (n-1) (n-1) (n-1) 
eyxi(t) + ceaxt) + +--+ + Cn-\Xn-(t) = X(t) (6-6) 
To prove that ¢,, C», ..., c,., are constants, we first differentiate the 
first of Eq. (6-4) with respect to and subtract from it the second of Eq. 
(6-4) and obtain 
€,x,(t) + eyxy(t) + --- + Cra =O 
Next, differentiating the second of Eq. (6-4) and subtracting from it the 
third of Eq. (6-4), 
€,%,(t) + ext) +--+ + bpp Xn-,(t) = 0 
Similarly, 
eX (0) + jx) + +++ + €n-1¥n-,(t) = 0 


(n=?) (n-2) 2 (1-2) 
€yx(t) + e(t) + +++ + bn sXn-1(t) 


= 0 
Hence we obtain 
x(t) x,(t) Xn-y(t) €, 0 
X) X(t) Xa) es 0 
(n-5) (n-2) TN 
x(t) x(t) E E CA M CS HQ 
By assumption W(x,, X», ... , Xn-;) Æ 0. Hence 
é.=¢,=---=¢,.,=0 


which implies that c,, c,, ..., C,-; are constants. Therefore, we conclude 
that x,(r), x(t), ..., Xn(t) are linearly dependent for the time interval 
fo t < 4, because, for the time interval r, « t < t,, we have 


cixl) + ext) + +++ + Cn-1Xn-1(t) — X(t) = 0 


which clearly shows that not all of the coefficients of the x,(t) are zero. 
[Note that the coefficient of the x,(t) term is —1.] This completes the proof. 
Consider the following scalar differential equation: 


X raf to as Dx + ax = 0 


By defining 
x 
Xs 
(1-1) 
Xn= X 
we obtain 
Xi 0 1 0 tee 0 X 
Xs 0 0 1 see 0 Xi 
Xn —a,{t) —@py_,(t) —a,_,(t) —a,(t) Xn 


The solutions of this last equation form an n-dimensional vector space. We 
shall later present a theorem which implies that the necessary and sufficient 


condition that x,, X.,..., Xn where 
Xiz Xi 
Xai Xi 
x=| | |= 
(n-1) 
Xni x; 


are linearly independent and hence they are a basis of the n-dimensional 
vector space is that 


X(t) Xt) e Xn(to) 
%,(to) Xo(to) ENS Xn(to) 

W(t.) = : : : +0 
(2-1) 


(n-1) aisn 
xX,(to) Xa(to) Xn(to) 
(See Theorem 6-4. Since t, is arbitrary, if W(t.) 0, then W(t) = 0 for any 
t.) 


Theorem 6-3 Suppose that x,(t), x«(1), - - -, Xn(t) are n solu- 


tions of a linear differential equation 
(n) (n-1) F = 
L(x) = X -a(f)x +++ + Galt) + an(t)x = 0 
where the a,(1) are absolutely integrable as functions of t in the time interval 
tp <t<t,. If the Wronskian W(x,, x» ---> Xn) assumes zero at t = fo, 
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where f, < f, X t, then x,(t), x(t); ..., Xn(t) are linearly dependent in 
the time interval t, < t < t, and W(x,, xs, ..., Xn) = O in the same time 
interval. 


Proof By assumption, there exist c,, c, . . . , Cn, not all zero, 


such that 
€,Xy(t9) + coxs(t5) + +++ + CnXn(te) = 0 
C11 (f) + CoX (to) + +++ + CnXn(te) = 0 
(n-1) (n-1) (n-1) 
C,X; (tg) + esxs(t,) +--+ + CaXn(te) = 0 
This set of equations implies that the solution 
y(t) = ex (t) T exs(t) Se a CnXn(t) 
of L(x) = 0 satisfies 
. (1-1) 
Y(t) = Wt) = +++ = (t) = 0 
On the other hand, the function identically equal to zero is also a solution of 
L(x) = 0, and the coefficients of the derivatives of this function are all zero. 
By uniqueness of the solution of the linear differential equation, we conclude 
that y(t) = 0. Hence 
eyxy(t) + ext) + +++ + cxt) = 0 
for the time interval fj « t < t. Since not all c's are zero, x,(t), x.(t),.-., 
x,(t) are linearly dependent. By Theorem 6-1, W(x,, X4, ..., Xn) =O for 


ips Pe i. 


Theorem 6-4 
differential equation 


If n solutions x,(t), x,(t),..., x&(t) of a linear 


(a) (n-1) 
L(x) = x 4 ay(t)x +--+ + a4 (HX + a,(t)x = 0 
where the a,(t) are absolutely integrable as functions of ft, are linearly in- 
dependent for the time interval f, «Ct «t t;, then the Wronskian W(x,, 
Xs... X4) is nonzero in the same time interval f, « t « t. 


Proof The proof follows from Theorems 6-1 and 6-3. Details 
are left to the reader. 


Theorem 6-5 
differential equation 


Let n linearly independent solutions of a linear 


(n) (n-1) 
L(x) = x + a(t)x + --- + a,_,(t)X + a,(t)x = 0 
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where the a,(¢) are absolutely integrable as functions OFF BE 3.) xus, 
x,(t). The general solution of L(x) — 0 can be expressed as 

exi) -E ep) 4- - CnX p(t) 
where ¢,, Co; ..., Cn are constants. 


Proof By assumption, xi(t). HAL); olg a4) are linearly 
independent. By Theorem 6-4, the Wronskian W(x,, Xs, ..., Xn) is nonzero 


everywhere. 
Let an arbitrary solution of L(x) — 0 be z(t). Then for an arbitrary time 


t=, 


WX; Sig o osin FO 
Therefore, there exist constants Cı, Cs, . . . , Cn» not all zero, such that | 
EAC + CRAG) ap asp CaXn(t,) = z(t) 
ex, (t) + eo) F m WP CnXn(t) = 2(t,) 
(n-1) (n-1) (n-1) (n-1) 
ox; (f) SE esxs (tz) + fé a Cx t3) = z(t) 


«3 Gn put 


Using these constants ¢,, Cy, - - 
y(t) = Ent ei) E se E Cn Xat) 
Then 
L(y) = 0 
and 
. (n-1) (n D 
y(t,) = z(t); H(t) = 204), «+s y(t) = z(t) 
Thus y(t) and z(t) are solutions of the given differential equation satisfying 
the same initial conditions. By uniqueness of the solution of the linear 
differential equation, y(t) = z(t). Hence we have proved that an arbitrary 
solution z(t) can 5e expressed as 


AD) = 62:0) + ent ee + CnXn(f). 
Since z(t) involves n arbitrary constants, it is a general solution. This com- 
pletes the proof. | 


Fundamental matrix Let us form an n X n matrix X such 
that its n columns consist of n linearly independent solutions, or n basis 
vectors of Eq. (6-2). Such a matrix X is called a fundamental matrix. A | 
fundamental matrix is nonsingular. Any fundamental matrices satisfy the 
matrix differential equation X = A(t)X. Notice that fundamental matrices 
differ from each other by a multiplicative constant matrix. For a given 
initial condition, the fundamental matrix is uniquely determined. 


V —ÀÀ—À—— AE 


— 


e M RR € 


Let an n X n matrix c(t, te) be the solution of the following matrix 


differential equation: 


d 
a; 9€ to) = A()®(1, to), (io, t) = I 
where A(z) is the same n x n matrix defined in Eq. (6-2). Any fundamental 
matrix X(t) of Eq. (6-2) can be written as 
X(t) = ext, t))C 


where C is a nonsingular matrix. Let us denote n linearly independent column 
vectors of @(t, ty) as «b,(r, to), a(t, to), ..., o, (1 to). Then the ®;(t, te) 
define a set of basis vectors. 

Notice that 


ult, to) 
Pailt, to) 
e(t, to) = ‘ E12, save 5) 
dbnilt, ty) 
represents the response of the system with the initial condition given by 
0 
. i—1 

0 

Gf, t) =| 1 

0 

0 


The basis vectors ®,(f, £j) do not involve delta functions and are infinitely 
differentiable for t > 1. 


Note that if the «b,(r, t) (i = 1, 2,...,) constitute a set of basis vec- 
tors of a given system and if the ®,(t, t) (i= 1, 2, ..., n) constitute a set 
of basis vectors of the same system, then 

e, 1) = Blt, ()C 
where C is a nonsingular matrix. Note also that 


D(t, to) = X(t) X- (t) 


Solution of homogeneous vector matrix differential equation 
Consider next the homogeneous linear vector matrix differential equation 


(6-7) 


L 


x(t) = A(t)x(t), — x(t) = Xo (6-8) 
where x(t) is an n vector and A(t) is the same n x n matrix defined in 
Eq. (6-2). Any solution of Eq. (6-8) is given by 

x(t) — e(t, t,)x(to) (6-9) 
where ®(t, to) is the unique solution of Eq. (6-7). We can easily verify this. 
Since 

X(to) = P(to, ty) X(to) Ix, = Xo 
and 
x(t) = Ze, xlt) = OU A) 


= A)(I, to) x(t0) = A()x( 

we have verified that Eq. (6-9) is the solution of Eq. (6-8). a 

We see that Eq. (6-9), the homogeneous solution of Eq. (6—8), is simply 
a transformation of the initial condition. Hence the unique fundamental 
matrix @(f, tọ) is.also called the state transition matrix. By Eq. (6-9) it 
maps the state x(f,) (or xo) at time to into the state x(t) at time t. The state 
transition matrix contains all informations about the property of free 
motions of the system defined by Eq. (6-8). 

The unique solution x(t) observed at time ¢ after starting at time f, in 
state x, is sometimes written as P(t; Xo, t,). As time elapses ét; Xo, to) 
represents a trajectory in the n dimensional vector space. Using this notation, 
if x(4) = Xo, d(t;; Xo to) = X», and P(ts; Xo ty) = X, then we have 
lts; Xo, to) = (ts; Xo» t2) = Xs- Hence 

X, = (t5; X5; fe) = Plts, ty)X_ = P(t, fo) Plt, to)Xo 
Also 
X, = P(t3; Xs, 12) = dfi; Xo, to) = ts, fo)Xo 
Hence 
Plta, ta) (te, to) = Qs, to) 
Since /, may be any time within the interval tọ < 1; < fı, upon substituting 
t, = t, in this last equation we obtain 
Dt, ta) P(t, fo) =I 
Since t, can be arbitrary within the interval fp « t, < fı, dropping the 
subscript 2 we obtain 
Plta, t) P(t, t) = I or e- (t, to) = P(to, t) 
Example 6-1 Consider the following differential equation: 


x+2%=0 

This equation can be written as 
pb (6-10) 
Žž —2Xs 
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where x; = x(t) and x; = x(r). Rewriting in the form of the vector matrix differential 
equation, we have 


x(t) = Ax(t) (6-11) 
where 
Tag) _ [0 1 
x0 - [45]- el 


We shall assume that f, = 0. The solution of Eq. (6-10) can be obtained by use 


of an elementary technique as follows: 
X(f) —xt)- ce (6-12) 
X(t) = xY(t) = —2c,e7*4 


where c, and c, are arbitrary constants. From Eq. (6-12) it is possible to write 
ied =) 1 er? 
[aol [s] ^L] 
X(t) = exi(t) + ecixs(t) 


«o». so 3] 


In the following, we shall see that x,(t) and x;(/) are solutions of Eq. (6-11) and are 
linearly independent of each other. Since 


wth] Jl] 
e 


wil IH LE uno 


x,(t) and x,(t) are solutions of Eq. (6-11). Linear independence of xy(r) and x(t) 
may be seen from the following: if 


CoXi(t) + cxt) = 0 


or 


where 


and 


which means 
Co Hee” = 0 
—2c,e7* = 0 
then it is obvious that 
€,— c0; —0 
Thus, x,(t) and x;(/) are linearly independent. Since x,(t) and x,(t) are solutions 


of Eq. (6-11) and are linearly independent of each other, x,(t) and x(t) form a 
fundamental matrix of Eq. (6-11). A few examples of fundamental matrices of Eq. 


(6-11) are 
[i et 1 1—e* [? ert! ert 
lo —2e-2! , 0 I 2e-2t | " etc; 


Qe-2 | —2e-2 
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. Fora given initial condition, the fundamental matrix is uniquely determined. Noting 


that in the present analysis 7, = 0, we obtain the fundamental matrix «b(f, t) = 
@(t) which satisfies «p(0) = I as 


1 ¿(1 em 
lo e l 


4 »  &(t)= [* Pa 


It can be verified that fundamental matrices differ from each other by a multipli- 
cative constant matrix as follows: 


i 1—e*"] fl. e* Ii " 
j 2e? | |0 -2e*ji0 —I 

] + e l= e [p e?! ths | 
| 2e"* | Lo -—2e*]l1 —1 


—2e7?t 
I .¢% 9 rd uil E '] 
lo —2e*| Lo et 0 —2 


Hence 


90-| 


We point out an important property of ®(t). For the constant matrix 

A we have 
e(t + 7) = B(t) O(7) = Bi) D(t) 
from which 
(t) = O(7) B(t — 7) 
or 
O(t)= On) O(t—7)  (—12,...,n) 

where the ®, are the n columns of ®. Thus ®,(t) can be expressed as a 
transformation of ©,(¢ — 7). 

In the system considered in Example 6-1, @(f) is given by 


1 i(1— e^ 
anfi MI 
Hence 
[1 4(1—6 2-2) 
e E 7) m lo g e-Xt-70) | 


The foregoing property can be verified thus: 


e-[1]- B. Pez] 


= (7)® (r — 7) 


also 
a= [2 ]-[ 


= O(7)@,(t — 7) 


£0 — 4 las = i 


ev e Xt-7) 


If A(t) is a constant matrix, then ®(r, t) can be obtained as an expo- 
nential function of Ar. (Refer to Section 6-3.) This is not, however, the case 
for the time-varying matrix A(t), since exp [7 A(7) dr is not necessarily the 
fundamental matrix ®(t, t) unless A(t) and [i A(r)dr commute. Namely, 


t if A(z) and f x A(7)dc 
e te) = =e j to Ayer Fh not commute for all » 
and 


(t, to) = exp f A(r)dr (i A(t) anc fi, "rand 


commute for all 7 

Notice that if A is a constant matrix or if A(t) is a diagonal matrix, then 
clearly A(t) and f; A(7) dr commute. For a time-varying matrix A(t) there 
is no convenient formula for computing SP(/, tə) and, thererfore, except for 
simple cases we frequently have to resort to numerical integrations for 
computing (f, tọ) in time-varying systems. 


Solution of nonhomogeneous vector matrix differential equation 
Consider the following vector matrix differential equation: 


x(t) = A(r)x(t) + B(r)u(r), (6-13) 


where x(r) is an n vector; u(t), an r vector; A(t), an n x n matrix; and 
B(1), an n X r matrix. Here we assume that the elements of A(t) and those 
of B(t) are absolutely integrable as functions of £ in the interval f DE. 
Let @(t, t) be the unique matrix which satisfies Eq. (6—7), rewritten as 
follows: 


x(%) — X; 


TOU= ADDU), — Q5) —1 


Assume that 
x(t) = ext, t) y(t) 


then 
x = (oy) = oy + by = y + Ady = Ay + Bu 
Therefore, 
oy = Bu 
or 
t 
y(t) = f, D'C, t.)B(r) u(r) dr + y(t) 
Since 


Y(to) = D- (to, fy) X(to) = X(t) = Xo 


——— a e — 


—— M — 


the solution of Eq. (6-13) is given by 
x(t) = ext, to) y(t) 


= Bt, nx, + 5) f OG 5)BG)wr) dr (6-14) 


This solution can also be written as 
t 
x(t) = dt, f,)X> + [ , (t, 7) Br) u(r) d7 


Notice that the solution of the vector matrix differential equation (6-13) is 
made up of the homogeneous solution (f, ¢,)x,, and a particular solution 
x,(t) given by 


xt) = (t, 4) f^ 7t, t) B@)u(r) d7 


=f D(t, 7) B(z)u(7) dz 


6-3 SOLUTIONS OF LINEAR TIME-INVARIANT 


VECTOR MATRIX DIFFERENTIAL EQUATIONS 


Although most physical systems are more or less time-varying, many of them 
may be represented approximately by linear time-invariant equations. This 
approximation is good if characteristics of the system change very slowly 
compared with the input variations. In this section we shall give a particular 
attention to linear time-invariant differential equation systems. The general 
method for finding solutions of linear vector matrix differential equations 
presented in Section 6-2 can be simplified for time-invariant systems. 

Notice that in time-invariant systems the response does not depend on 
tə the time the input is applied. Hence we can conveniently choose /, as the 
origin of the time axis and consider that /, equals 0 in order to simplify 
presentation of the analysis of linear time-invariant systems. We shall 
therefore put t, = 0 in the following analyses, unless otherwise stated. 


Homogeneous equation (free dynamic system) Consider the 


free motion of the dynamic system described by 
x(t) = Ax(t), x(0) = x, (6-15) 


where x(t) is an n vector which defines the state of the system at time / and A 
is an n X n constant matrix. The solution of this vector matrix differential 
equation can be obtained by various methods. One method was presented 
in Section 3-8. We shall first review this method (which we shali call Method 
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1) and shall present two other methods (Method 2 and Method 3) in what 
follows: 
Method I In Section 3-8, the system equation, Eq. (6-15), was 
transformed by the transformation x — Py into 
y = P-'APy 
If A has distinct eigenvectors, then P-!AP can be made equal to D = diag (A,, 
Xo, . .. , Àn) and the solution is found as 


et 0 
et 
y(t) = i y(0) = Q(t) y(0) 
0 e^t 
Since 
e^t 0 
gist 
Q(t) = . ‘ = edt 
0 et 
we obtain 
y(t) = eP'y(0) 
or 
x(t) = PQ(r)P-'x(0) 
= Pe?! P-!x(0) 
= Pe(P-'AP) P-1x(0) l (6-16) 
In the case where A involves multiple eigenvalues X; of multiplicity m, (i = 1, 
dua kim, + my, +... +m = n) and the eigenvectors corresponding to Ai 


are also multiple with the corresponding multiplicity m, then P-!AP becomes 
the Jordan canonical form, which we shall denote by J. The solution x(r) in this 
case is given by 


x(t) = PS(r)Q(r)P-'x(0) 


= Pes‘ P-1x(0) 
= Pe(P AP) p-1ix(0) (6-17) 
where S(t) is given by 
S(t) 0 
St) 
S(t) = ; 
0 S (t) 
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where 
t? pmol 
LI NOU (m-D 
tpm- 
o X. o£ s | 
S(t) = . = m, xm, matrix 
00 0 cs t 
00 0 «=: 1 


[Refer to Section 3-8 for derivation of S,(t).] 
Method 2 Another way to find the solution of Eq. (6-15) is to 
proceed as follows: By analogy with the scalar case 
E(t) = ak), EO) = E 
whose solution is E(t) = e% £, we use the matrix exponential 
co A*t* 
At J 
e + P Ei 


for finding the solution of Eq. (6-15). The series e^ converges absolutely and 
uniformly in any finite interval of the time axis. Since 


d (eA) = Ae^ 


the solution of Eq. (6-15) satisfying the initial condition x(rj) = x, is given by 

x(t) = eAtx, (6-18) 
For t = 0, e^ reduces to the identity matrix. Therefore, Eq. (6-18) clearly satisfies 
the initial condition. 

In this approach we must compute e^ for a given A. Computation of e^ can 
be accomplished by a few different ways. (Four such methods are presented in 
Section 6-4.) 

Method 3 The third method is based on the analog computer simula- 
tion for solving the vector matrix differential equation. Essentially the simulation 
consists of n integrators. As an example, consider the following system: 


x = Ax 
where 
xi Qj A Qiz 
X-—|X |, A =| an an az 
Xs G3, Q32 3; 


Figure 6-1 shows an analog computer diagram for solving this third-order system. 
In order to simplify the essential part of the diagram, we made the convenient 
assumption that the a;; are negative and |a;;| «& 1. This assumption is, of course, 
inessential and the diagram can be modified easily to the general case where the 
aj; assume any real values. p 


Figure 6-1 Analog computer diagram for solving x = Ax. 


A few comments on ®(t)' From Eq. (6-16), Eq. (6-17), and 
Eq. (6-18) we note that the unique fundamental matrix c(t) which satisfies 


$(r) = Aó(, (0) —I 
is given by 
P(r) = e^ 
=Pe™P-' — (or Pel! P-') 
= Pe(P'AP)tp-1 (6-19) 


Notice that since 
e^t — pe PAP: p-: 
‘For time-invariant (or constant coefficient) systems ®(ż, to) = e(t — tə, 0). Hence for 


such systems we usually use the simplified notation ®(t — to) or’ ®(r) if ty = 0, rather 
than ®(z, to) or (t — to, 0). 


we have 
P-!e^ p — e(C7APX 
Notice also that, for an n x n constant matrix A, 
e(t + 7) = e^t = Dt) @(7) = e(7)e(r) t,T20 

and 

Q-!(f) = e^ = ó(—r) 
Hence the solution of 

X = Ax; X(t) — X; 
can be written as 

x(t) = e^t- x(t) = e(t — t)x(t,) 

From Eq. (6-19) we see that the elements of @(t) consist of terms like 
te (5—0,1,2,..., m, — 1, where m; is the multiplicity of the eigenvector 
corresponding to the eigenvalue A). Specifically, if the eigenvectors of A are 
distinct, elements of ®(r) consist of e* (i = 1, 2,..., n). If the eigenvectors 
of A involve, for.example, one multiple eigenvector corresponding to the 
eigenvalue A,, and n — m distinct eigenvectors corresponding to Am+p-- -3 
An then the elements of ®(t) will involve the terms /^e^* (h = 0, 1,2, ..., 
m — 1) and e** (i 2 m +1, ..., n). Hence the elements of @(t) can be 
differentiated as many times as we wish for t > 0. 


Nonhomogeneous equation (forced dynamic system) Consider 
the following vector matrix differential equation: 
x(t) = Ax(r) + Bu(t), x(0) = x, (6-20) 


where x is an n vector, u is an r vector, Ais an n X n constant matrix, and B 
isann X rconstant matrix. By writing Eq. (6-20) as 


x(t) — Ax(t) = Bu(t) 
and multiplying both sides of this equation by e^^', we obtain 
e-^'[(r) — Ax(t)] = A [e-^*x(r) = e-^'Bu(r) 
Integrating the preceding equation between 0 and t we get 
e7*tx(1) — Xy+ f e-^*Bu(7) dr 
or 
x(t) = ex, + f e^t-?Bu(z) d7 (6-21) 


This is the solution to Eq. (6-20). Of course, the same solution can be 
obtained from Eq. (6-14). To demonstrate this, rewrite Eq. (6-14) as follows: 


x(t) = (x, + DC) f (But) d (6-22) 
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where ®(z) is given as 
M(t) = e^ 
Since 
D(t)D"'(r) = (t — 7) = e^v-?» 
Eq. (6-22) can be simplified into 


x(t) = e(r)x, + f D(t — 7)Bu(7) dr 
= ey, + f e^t-?Bu(7) dr 


which is the same as Eq. (6-21). 
Notice that if the initial time is given as z, instead of 0, then we have 


x(t) = e&Mt-t&x, + [^ eht-PBu(z) dr 
to 


where X, = X(f) 


The response x(t) is a sum of the motions due to the initial condition and 
those due to the forcing function. The motions due to the forcing function 


Figure 6-2 Analog computer diagram for solving X = Ax + Bu. 
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depend on B. It is possible to choose B such that any particular motion 
eè! cannot be excited for any input. This particular motion can be excited 
only by the initial condition. (Refer to Example 6-3.) 

Next we consider an analog computer simulation of the nonhomogeneous 
vector matrix differential equation. Consider the following system: 


X = Ax + Bu 


x= | * a= ("| a= |* al B= (> “a 

Xs Us Ay, Az bs b. 

The solution of this differential equation can be obtained by the analog 
computer simulation shown in Fig. 6-2. In the diagram we assumed that the 
values of the a,, are negative and | a,;| < 1; the values of the b,; are positive 
and |5,;| < 1. Such assumptions, however, are inéssential and the diagram 
can be easily modified to handle the general case where the a,; and the b;; 
assume any real values. 


Example 6-2 Obtain the solution of the following vector matrix 
differential equation where u is a unit step function occurring at t = 0: 


ad —6 1 Qx 2 
X,|2|—11 O 1|[|xe|-| 6 | [u] (6-23) 
n" —6 0 OJLx; 2 
or 
X = Ax + Bu 
Referring to Example 4-2, we find that by the following transformation 
x = Py 
or 
x; —1 2 1 Yı 
X |=| —5 8 3|| » 
xg —6 6 2JL» 
Eq. (6-23) can be transformed into 
Jy -1 0 Oll» 1 
p = 0 —2 O}] y2 |+] 1 | (4) 
L5 0 0 —3) L» 1 
or 
y = P-!'APy + PB (6-24) 


The solution of Eq. (6-24) is obtained as follows: 
y(t) = eF"^Pty, + fjePrAPe-PP-1Budr 
0 


or 
»0)| [et 0 oOoo[x»(0) 1 — e^t 
WA SE 9. its ua) | y0) | +] $(1 — e=?) 
wt LO o ely} L10-—e) : 


In terms of the original state vector x(t), the solution is given by 


X(t) = Pe" APip-ix, + P [erara-np-iBudz 


or 
x(t) =f -2 Fipe 6 0 —i à —i][x(0) 
X(t) |=| —5 8 3 0 e o —2 1 —}]| x,(0) 
x(t) —6 6 2 0 0 el, $ = $] Lxsy(0) 
ae 1 l— e~ 
+| —5 8 3||1(1—e-2) 
—6 6 2JL} (1 — e- 
Example 6-3 In this example we demonstrate that in the system 
X = Ax + Bu, x(0) = x, 


if B assumes a certain form, then a particular motion eè! cannot be excited for any 
input. Such a particular motion can be excited only by the initial condition. 
Consider the system described by 


s d 
(p + 1Xp + (p +3)x=(p+1)u, p= pi 
Referring to Example 4-4, the simplified state space equation of this system is 
Vy —1 0 OEA 0 
»|| 9-2 Ol|l»|-c|1 | Ld] 
Vs L- © 0 —3) Lys 1 
The solution of this last equation is given by 
»] fet 9 O77y@] ,[ o 
yo|=| O e* 0 y0) | + e-*t-7 | u(r) dr 


Ys 0 © e-*] Ly,(0). ? | g-3u-n) 


Clearly, the forcing function u is uncoupled to the motion e-t. (The first com- 
ponent of the control matrix in the simplified state Space representation is zero.) 
Therefore, this motion can be excited only by the nonzero initial condition y (0). 
This situation occurs whenever factors in the numerator and denominator of the 
transfer function cancel each other. (In the present system the terms (p + 1) in 


both sides of the equation cancel each other.) We shall discuss such cases again 
in Section 7-6. 


Example 6-4 Consider the system described by 
x(t) = Ax(t) + Bu(r), x(0—) = x, 


t 


where x = n vector ‘ (state vector) 
u = r vector (control vector or input vector) 
A =n X n matrix 
B =n X r matrix 
Obtain the response to each of the following three inputs: i 
1. The r components of u are impulse functions of various magnitudes. 
2. The r components of u are step functions of various magnitudes. 
3. The r components of u are ramp functions of various magnitudes. 
Assume that each input is given at ¢ = 0. 
Referring to Eq. (6-21) the solution x(t) can be written as follows: 


x(t) = ex, + [^ e^t-PBu(z) d7 
0- 
(1) Impulse response: Let us put the impulse input u(r) as 
u(t) = S(t) w 


where w is a vector whose components are the magnitudes of r impulse functions 
applied at t = 0. The solution-to the impulse input 8(r) w given at t = 0 is 


x(t) = e^, + f’ eAt-PB3(r) wdr 
= eAlx, + eA Bw 
(2) Step response: Let us put the step input u(r) as 
u(t) =k 


where k is a vector whose components are the magnitudes of r step functions 
applied at t = 0. The solution to the step input applied at ż = 0 is given by 


x(t) = e^'x, + ja eAC BK dr 
0 
= eAtx, + A7!(e^' — I)Bk 
(3) Ramp response: Let us put the ramp input u(f) as 
u(t) — tv 


where v is a vector whose components are the magnitudes of ramp functions given 
at t = 0. The solution to the ramp input tv given at ¢ = 0 is obtained as 


x(t) = e^'x, + a eAt- Bry dr 
0 


= eAtx, + [A*(e^' — Y) — A-!:] Bv 


6-4 METHODS FOR COMPUTING e^! 


The solution of the linear time-invariant vector matrix differential equation 
s ; J 

involves the matrix exponential, e^'. It is necessary to express e^' as a square 

matrix in order to obtain numerical solutions of x(t). 


1This section follows (6-4, 6-5). Figures in parentheses refer to references listed at the 
end of this chapter. 
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There are several methods available for computing e^'. If the number of 
rows of the square matrix becomes five or larger, hand computation 
becomes exceedingly tedious and machine computation may become neces- 
sary. (There are standard computational schemes of e^' for machine com- 
putation.) We now present the following four methods: 


Method 1 Transform A into the Jordan canonical form. If 
the eigenvectors of A are distinct, then referring to Eq. (6-19), we find that 


e^ € Pe? "'^Pxtp-1 = PeP?!P-! 


(Refer also to Theorem 5-5.) Once A is given and the eigenvalues are deter- 
mined, then a necessary transformation matrix P can be obtained by the 
methods presented in Chapter 3. e^' can then be obtained as a square matrix 
as follows: 


et 0 


et 
e^ = Pe™P-! =P DNE © (6-25) 


0 ' et 
Equation (6-25) applies only to the case where A has distinct eigenvectors. 


If A involves multiple eigenvectors, then e^' can be written in a slightly 
complicated form. (Refer to Section 3-8.) 


Method 2 Expand e“ into a power series in t. (The matrix 
exponential e^ is uniformly convergent for finite values of 1.) Add the 
corresponding elements in the matrix terms of the infinite series. This method 
was discussed in Section 5-2. Except for very simple matrices A this method 
is not quite efficient for hand computation. 

Since e^' is given by 


eec ien QD OR e Mrs) e 


note that each term in parentheses is equal to the entire preceding term. 
This provides a convenient recursion scheme. If a computer is available, this 
method may be used. The computation is carried out to only enough terms 
so that additional terms are negligible by comparison with the partial sum to 
that point. 

In obtaining e^' by the use of a digital computer the virtues of the series 
expansion technique are its simplicity and ease in programming. It is not 
necessary to find the eigenvalues of A. There is, however, some computational 
disadvantage to the series expansion method. This disadvantage stems from 
convergence requirements for the series e^. The running time for a simula- 


MOORE TL E MET UIMUE EUCH I e TET UTI a ES EE CUT TON TEA SETS DTAP Y AaS 
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tion will be the longest among other schemes for computing e^. Use of the 
Jordan canonical form requires considerably more programming but will 
run in a fraction of time needed for the series solution. 

The series expansion of e^' fails to provide the insight into the system 
behavior afforded by the Jordan transformation, because the characteristic 
responses of different modes are not separated. 


Method 3 Use the Laplace transform method. Taking the 
Laplace transform of the following equation: 


X Ax, x0) = X; (6-26) 
we obtain 
sX(s) — x, — AX(s) 
or 
(sl — A)X(s) = x, 
from which 
X(s) = (sI — A)"!x, (6-27) 
Since the solution of Eq. (6-26) is given by 
Xm My, 
we obtain 
X(s) = .Z[e^']x, (6-28) 


Comparing Eq. (6-27) and Eq. (6-28), we see that for t > 0 

Le] = GI — A)" 
Thus, to obtain e^t, first invert the matrix (sI — A). This results in a matrix 
whose elements are rational functions of s. Then, take the inverse Laplace 
transform of each element of the matrix. (sl — A)" can be obtained by 
computing the cofactors. This, however, may not be very efficient. In Section 
6-5 we shall present two other methods for computing (sI — A)"'. 


Method 4 Use Sylvester's interpolation formula. We shall 
first consider the case where the roots of the minimal polynomial $(X) of 
Aare distinct. Then we shall deal with the case of multiple roots. 


Case 1: where the minimal polynomial of A involves only distinct roots We 
shall assume that the degree of the minimal polynomial of A is m. By substi- 
tuting f(A) = e^' into Eq. (5-8) we obtain 


| xp e appe pt 
| X Xp ose AT e*t 
. eae : 5 : =0 (6-29) 
Ls hes NES 
I A A? o A" eM 


By solving Eq. (6-29) for e^t, e^t can be obtained in terms of the A‘ (k = 0, 1, 

2,...,m — 1) and the e* (i = 1, 2, ... , m). [Equation (6-29) may be ex- 

panded, for example, about the last column.] 
Solving Eq. (6-29) for e^' is the same as writing 


e^! — aul + QA + aA? + -e H au ATO! 


and determining the a, (k = 0, 1, 2, ..., m — 1) by solving the following 
set of m equations for the o: 

Ay + AA, + QAP F +++ + OS ATO = et 

Qs + Ayre + AA H --- + GANT = ed! 


Oy + Ga + G3AÀ, + +e) + OGNI = en 


If Ais an n X n matrix and has distinct eigenvalues, then the number of the 
a, to be determined is m = n. If A involves multiple eigenvalues but its 


minimal polynomial has only simple roots, however, then the number of the 
a, to be determined is m < n. i 


Case 2: where the minimal polynomial of A involves multiple roots 
Suppose that the minimal polynomial of A involves three equal roots. Then, 
by substituting 

F(A) = e^ 
into Eq. (5-18) we obtain 


66 L 39v (m — Des =- 2) p Des 

© Y X oY (m — gp? ret 

ly M M -:- a et 

l Dy AS AP ríe AT ps [79 — (630 
loh AA AM es Ax e 

| a: on. A í|ee Am-! e^ 


Equation (6-30) can be solved for e^t. [Equation (6-30) may be expanded, 
for example, about the last column.] Just like Case 1, solving Eq. (6-30) for 
e^! is the same as writing e^ as 


e = al + QA + QA? + +++ + ou ArT! 
and determining the a, (k = 0,1,2,...,m — 1) from 


y+ Sas + o Qn Dn Du, aps = Te 


a, + 2a.r, + 3a4NXl--- + (m — l)a, AT? = tet 


| 


L 


Qo + Ar, + Apt eee + Qu P = eu 
Qs + MA, + ari Ss Amir) = et 


i — das 
Oy + On + OX E o ORO = En 


The extensions to other cases where, for example, there are two or more sets 
of multiple roots will be apparent. Note that if the minimal polynomial of A 
is not found, we may substitute the characteristic polynomial for the minimal 
polynomial. Amount of computations may, of course, be increased. 
Example 6-5 Compute e^' where 
0 17 
A= 
[o -2l 
We shall illustrate computation of e^t by use of the four methods presented 
earlier. 


Method 1- The eigenvalues of A are 0 and —2. Hence, the necessary 
transformation matrix P is given as 


Therefore from Eq. (6-25) e^ is given as 


"Ai J e Ale | 
* =h -2]lo ello — 


= [a zü-— i 
0 gH 


Method 2 ^ ë^! may be expanded as series of matrices and then added 
together into a closed form as follows: 


Nie Me 
oJ 


2t? Qty 
l i-i-:«9 EE x ) 
D 2 3 (ry 
0 1 — 2:4. GP Qm GY... 
1 40 —e3) 
-lo gu 


Method 3 Since 


azai J-i -alo ies] 
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(sI — A)! is obtained as 


GI — A)" Ss et) 
0 1 
s+2 
Then we have 
id. 1 
hehe] a — Ae s s(s4-2) 
9. 2s 
$42 
or 
al | 1 
Laj] Z- 1(] — -it 
— E [57] " uocem 
ep eft] e £e 


Note that if (sI — A) is of complicated form, either of the two methods for com- 
puting (sI — A)! presented in Section 6-5 may be better in saving time for compu- 
tation. 


Method 4 Equation (6-29) becomes 


E X, e 
1 A. e*[-0 
I A e^ 
Substituting A, = 0, A, = —2 into this last equation we obtain 
1 CEIR | 
1 —2 e-*/=0 
I A e^t 


Expanding the determinant, we obtain 


—2e^ 4A +21 — Aet — 9. 
or 
e^ 


(A + 2I — Ae~*) 
MALE ee 


= r 


The alternate approach is to solve the following equations for a, and a: 
Qs + GÀ, = et 
Qs + aX, = eht 
Since A, = 0, A, = —2, Eq. (6-31) becomes as follows: 
Q1 
Qo — 2a, = e*t 


I 


(6-31) 
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from which 
Qs — 1, a, = 4(1— e7*) 
Thus, e^ can be written as 
e^ = a + aA — Id (1 — e79)A 
1 à1—e7*) 
= F e 


Once e^t is computed, the solution to the following vector matrix differential 
equation, 
[edo ndi] 
LS] “Lo i 
x(t) = e^'x(0) 


[d] D H Y1— th Dod 
x(t) 0 grt x,(0) 
EN [5 + 4x,(0) — A PR 
e~*x,(0) 
Example 6-6 Matrix Riccati equation (6-6) Consider the matrix 
differential equation 
X = A(OX + XB(t) + XC(0X + DC) 
where X is an n x n matrix and A(t), B(t), C(t), and D(t) are also n X n matrices. 
It is assumed that A(r), B(t), C(t), and D(¢) are absolutely integrable as functions 
of t in the interval t, < t < t,. This equation is called the matrix Riccati equation. 
The matrix Riccati equation arises frequently in the solution of optimal control 
problems. We shall show that the matrix Riccati equation can be solved by using 
two simultaneous linear matrix differential equations. 
Let us define two simultaneous matrix differential equations, 
Y = —B(0X — C(0Z 
Z = D()Y + A(0Z 
If in the interval << t < 4, the n X n matrix Y is found to be nonsingular, then 
the solution to the matrix Riccati equation can be given by 


can be obtained as follows: 


or 


(6-32) 


X(t) = Z(t)Y (0 t, « t «t 
This can be easily verified. Since 
X-ZY---ZY-s and  Y-!- -Y-YY- 


we obtain 
X = [DY + A()Z]Y^ + Z(-Y-)(-B(0Y — C()ZIY^! 
= D(t) + A(t)X + XB(r) + XC(óX 
If D(t) is zero, then Z may be solved independently of Y. Once Z is obtained, 
Y may be solved. In general, however, if coefficient matrices in the matrix 


s, 


me e ee amens tet 
m MAREy M rmn 


pero 


Riccati equation are time-varying, solving Eq. (6-32) for Y and Z is not a simple 
and D(z) are constant matrices, then it is easy 


matter. If, however, A(r), B(r), C(t), 
to obtain at least a formal solution for Y(t) and Z(r). 


If matrices A, B, C, and D are constant matrices, choose r, = 0 and define a 


2n X n matrix W(r) and 2n x 2n matrix G as follows: 
Y(t) —-B;-C 
a ec] 
Z(t) 
Then Eq. (6-32) can be represented by 
W(t) = GWG) 
whose solution is 
W(t) = e&W0) 


Once e“ can be obtained as a square matrix, then Y(t) and Z(t) can be determined 


immediately. If Y(1) is nonsingular, then the solution X(t) to the time-invariant 
matrix Riccati equation is given by 


X(t) = Z(t)Y-(t) 


6-5 TRANSFER MATRICES 


In this section, we present s plane solution of the linear time-invariant vector 

matrix differential equation. We then define the transfer matrix. Finally, 

we present two methods for computing (sI — A)-!, 
Consider the system described by 


X = Ax + Bu (6-33) 


where X — n vector (state vector) 


u = r vector (control vector) 
A — n X n constant matrix 
B — n X r constant matrix 


The Laplace transform of Eq. (6-33) with zero initial condition is 
5X(s) = AX(s) + BU(s) 
from which 
X(s) = (sI — A)-'BU(s) (6-34) 


The poles of this equation are the values of 5 for which the determinant is 
zero, namely, 


|sI— A|=0 


This is the equation determining the eigenvalues of the matrix A. Hence the 
poles of the system are the eigenvalues of A. 


a 


Transfer matrix Consider the system shown in Fig. 6-3. 
From the block diagram 
X\(s) = H,,(s) Ui(s) + His) U2(s) 


(6-35) 
X,(S) = H2,(s)U,(s) + Hass) Us) 


Figure 6-3 Multiple-input multiple-output system. 


In such a multiple-input multiple-output system, the transfer functions 
constitute a transfer matrix. Rewriting Eq. (6-35) in terms of vector matrix 
notation 


X) _ [A,,(s) ax] LE 
[xo] " Laat) H,,(s)| LU.G) 


In this system the transfer matrix consists of four components fs is yas 
H,,, and H,,. In the transfer matrix the (i, j)th element is the transfer func- 
tion between the ith output and jth input. It is the Laplace transform of the 
unit impulse response, the impulse being applied at the jth input and response 
being taken at the ith output. 

A state space equation for such a system defined in terms of the transfer 
matrix can be obtained easily if the transfer matrix Hs) relating the state 
vector x and the control vector u is a square matrix and is nonsingular. 

As an example consider the following system 

1 1 
X,(s) 5 sot 2) U,(s) 
2 "» 
s+2 


(6-36) 


X,(s) 0 U,(s) 
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Let us find a state space equation of this system in the following standard 
form: 


X = Ax + Bu 
From Eq. (6-34) and Eq. (6-36) 
1 1 
(sl — A)"B = s s(s : 2) 
" 54-2 
Hence 
ee es ee 
B-1(sI — A) = sS ii -- 
0 313 0 s+2 


‘to 7b -a 


Comparing the coefficients of the s terms and constant terins on both sides 
of this last equation we obtain 


Ea ok 


Hence the state space equation becomes 
" 1 O0][1u, 
0 lü 


X 0 1 

al- lo -2 

The eigenvalues of A are 0 and —2. These eigenvalues are clearly recognized 
in the transfer matrix. Xd nd 

One way to obtain a state space equation of the system given by X(s) 

— H(s) U(s) in which the transfer matrix H(s) is not a nonsingular matrix is 

to draw an analog computer diagram for simulation of the system. Then 

the output signals from the integrators are a set of state variables for the 

system. The state space equation for such a case can be written directly from 

the analog computer diagram. 


Xi 
Xs 


Computation of (s1 — A)! If we wish to determine the trans- 
fer matrix from Eq. (6-33), we must compute (sI — A)^!. Also, in computing 
e^ by the Laplace transform method we need computing (sI — A)^'. Com- 
puting (sl — A)~! is, except in very simple cases, generally a time-consuming 
task. The simplest but time-consuming way to obtain the n X n matrix 
(sl — A)-! is to compute cofactors. This is not efficient if n > 3. There are 
a few other methods for computing (sI — A)~!. We present two such methods 
in what follows: 
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Method 1! Let us put 
P = (1 —.A)"' 


then 
sP = AP +I 


By premultiplying (sI + A) to both sides of this equation we obtain 
sP — A?P A 4 sI 
Similarly, by premultiplying (sI + A) to both sides of this last equation, we obtain 
SP = AP + A? + 5A 4 S 
By repeating this process we obtain the following set of equations 
PP 

SP—AP-I 

SP — A?P -4- A — sI 

SP = ASP + A? + så + s 


sPP = APP + A?-! + sAP-? + oe) + POA + sr] 
where p is the degree of the minimal polynomial of A. Let the minimal polynomial 


of A be 
al +A + aA? HH +a,A? = 0 


where a; = 1. Then 


p 
: as P = X a, AIP. + > aA + s $, a AT? 
ito io i iti 


p 
4o. GR? Dy At Pal 
isp-l 


= * s £ q;Ati=i 
j=0 i=l+j 
Therefore, 
p-l p 
S > oA 
(SI — A)! =P == (6-37) 
mons 


If the minimal polynomial and characteristic polynomial of A are identical, then 
p =n. If p =n, then Eq. (6-37) becomes 


fees s 
I A)! — j=0 i=l+j 
(I — Ay = A 
where jsl —A] = z asi, i1 
(s 


Method 2 (6-4) The second method is based on Sylvester's inter- 
polation formula given by Eq. (5-6). We first consider the case where the minimal 
polynomial of A has only distinct roots and then consider the case of multiple roots. 


1Discussion of Method 1 follows closely (6-7). 


t 


Case 1: where the minimal polynomial of A involves only distinct roots Let us put 
f(A) = GI — A) 
The function f(A) can be written as 


p 
f(A) = È SSZ (6-39) 
where p is the degree of the minimal polynomial of A, and the Z,aren X n matrices. 
Equation (6—39) can be written as 


— Avin. Ze 
(sI — A) Pes .. (6-40) 
To determine the Z}, we substitute 
gs) = (s — si (Ce ee eee). 


where s, is any one of the p roots of the minimal polynomial of A, into the following 
p simultaneous equations and solve for the Z,: 


&(A) = 252 + £52. + +++ -gG)Z,  (G-—12,...,p) (6-41) 


Note that g,(A) = (A — s,T)'-!. The p simultaneous equations possess a solution, 
since 


ECS) mc) - eG) 
B(S) Sols.) +++ &2(Sp) 


£p(5;) &y(52) erue &y(55) 


(5,— s,)?7'* ($4 Sy)?! (Sp — Sy)?" 


As an example, consider the following 2 x 2 matrix A: 


A [o al 
0 -2 
The eigenvalues of A are s, = 0 and s, = —2. Hence, 
aa. Ze 
(sI — A)! = = Ts 


To determine Z, and Z., put 
&(s)-—(s—s)-  (i—1,2) 
(We can, of course, take s, as sx.) Then Eq. (6-41) becomes as follows: 
gA) = 81(5))Z, + gi(:)Z. 


(A) = 82(51)Zy + 80(S2)Zo 
or 


I=Z, + Z, 
A+ 2I =2Z, 


Hence, 
; iod 
i EU TP gum lo 0 
ER 
| z-I-z--i-[) 1 
it follows that 1 1 
A 2. s s(s +2) 
| 


pi Ardea " 1 
s+2 

Case 2: where the minimal polynomial of A involves multiple roots In this case f(A) 
can be written as 


my-1) 
f) - $y Cdn En P f Zen] 
k=l 


where m, = the multiplicity of the factor (s — 54) in the minimal polynomial of A 
q = the number of distinct roots Si, Se.» -Sq (g < p) of the minimal 
polynomial of A u 
p=m, +m, +--+ + mq = the degree of the minimal polynomial of A 


In the present case 
Sf (sx) = (S — s) 
JSk) = (s — Se)? 
(mn, -1) ae 
f (se) = (m, — DYs — s)" 
where the derivatives are with respect to s. Hence we have 
, (re—D' Pans | (6-42) 


ET Zi Zo M 
(sI =A = Xe Ss Sa + (s — Sx)? * E (s — s.)"* 


To determine the Z,;, we substitute 
g()-(s—s)"* G-2142,....p) 
where s, is any one of the roots of the minimal polynomial of A, into the following 
p simultaneous equations and solve for the Z,;. a= 
gA) = g(5)Zu + Bi) Zi2 = E © + 8: (50Zam, 
+ og(Z£n tc "E (Sq)Zame (6-43) 
(i= 1,2,....,p) 

where the derivatives are with respect to s. Note that a solution to Eq. (6-43) 
exists, since the following matrix of coefficients is nonsingular: 
teg ^h (Sy) FAC MEL "n Go 


(me-1) 


SES Ta (Si) gis) +++ Be (Sq) 


gs) s) 
gS) x5) 


B 


gs) sex5) 


(m«-1) 


Sa) dum) ce Bo 
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Example 6-7 (6-2) Consider a system described by 


X = Ax + Bu 
y = Cx + Du (6-44) 
where = n vector (state vector) 


r vector (control vector) 
m vector (output vector) 
n X nconstant matrix 

n X r constant matrix 

C — m X n constant matrix 

D — m x r constant matrix? 


x 
u 
y 
A 
B 


Let us determine the transfer matrices for this system. The Laplace transforms of 
Eq. (6-44) with the zero initial condition become 
sX(s) = AX(s) + BU(s) 
Y(s) = CX(s) + DU(s) 
from which 
X(s) = (sI — A)“ BU(s) 
Y(s) = [CGI — A)?B + DIUG) 
The transfer matrix between X(s) and U(s) is (sI — A)-!B and that between Y(s) 


and U(s) is [C(sI — A)-'B + DJ]. The partial-fraction expansion of X(s) can be ob- 
tained by use of Eq. (6-42). The partial-fraction expansion of Y(s) is given by 


4 CI Ze Zi (m, — t)! Zim, 
Yi) = {CS [tpat CQ emIB--DLUG) (6-45) 


Let us compute the transfer matrix between X(s) and U(s) and that between 
Y(s) and U(s) for the following A, B, C, and D: 


—5 —4 2 -1 0] 

A= 3 3 —2]| B= LJ 
9 2 -2 0 2 
C=[1 1 0, D=[0 0] 


In this system the roots of the minimal polynomial of A are — 1, —1, —2. 
To obtain the Z,; put 


&()-—1, gls)=s+1,  gy(s) =(s + 1)? 

Then we obtain the following three equations for the determination of the Zi: 
gA) = £(5)Z5 + Eil) Zie + gi(52) Zo 
8AA) = 8:(S)Zn + 8551) Zio + 2:52) Zo 
EA) = gx(8)Z + giS) Zi + Ex(52) Za 


(6-46) 


Substituting 


&(A-7L  £$(A)-—A-L (A) =(A +T} 


OBTAINING THE SOLUTION OF THE STATE SPACE EQUATION 329 


d 
an Ss = —1, So = —. 
into Eq. (6-46) we obtain 
I=Z,, + Zn 
A +I = Ze — Zn 
(A Ex I)? = Za 
a —3 —4 2 0 0 0 4 4 —2 
Zi- 0 1 91. Zi, = 3 4 —2|, Za = 0 0 0 
—-6 -6 4 6 8 —4 6 8 3 
(sl — A)^! is then obtained as 
"i Zu + Zio Za 
ae hak ee ee 


and X(s) is computed as 
X(s) = (sl — A) 'BU(s) 


=j 4 
s+1 (s + 1)(s + 2) | | Ui(s) 
“| G+ 1? G1» 
2 As — Ds + 3) | | Us(s) 
GFI  (G-rDXs-2) 


The transfer matrix between X(s) and U(s) is given by this 3 x 2 vam ps 
Eq. (6-47), x(t), x(t), and x,(t) for any inputs u, and u, can be compute easi : 
The Laplace transform of the over-all response from the two inputs to the single 


output is given by 
Y(s) = [C(sI — A)*!B + DJU(s) 


1 —4 ] | pied 
-|7 FIP G6 FIPRCF+2)) | UL) 
The output response for any known inputs u, and u, can be easily obtained from 
Eq. (6-48). 


(6-48) 


6-6 IMPULSE RESPONSE FUNCTIONS OF 
DYNAMIC SYSTEMS 


In this section we obtain impulse response functions of dynamic iari and 
find responses of such systems to arbitrary inputs in terms of impulse re 


sponse functions. m "m 
T Consider first the following linear time-invariant differential equation: 


(n-1) (n-2) > = 
krit +O, x 4e Bde ax (6-49) 


—————— — Ml tá 


mmy Roy pror. 


where the a; (i = 1,2,... , n) are constant. Equation (6-49) can be written as 


X. 0 1 0 eM 0 Xx 0 
s 0 0 1 0 Xi 0 
s PIT ; js PRI HM 
Xn =a sa; “Ong ^4 —, Xn l 
where 
X 
dX Xx 
(n-1) 
p X 
Let us define 
0 1 0 0 0 
0 0 1 0 0 
A= = ^ B Em 
Qn “a —Qn-g -:- —d, l 


With the initial time 7, = 0, the solution of this equaticn can be obtained in 
the following form: 


x(t) = &r)x() + | Dl — 7)B(r)u(r) dr 


where 
D) = [Di D) --- 1,0] 
Qut) pilt) p Qut) 


Palt) y(t) --- Pon(t) 


nlt) Pno(t) PAA Ont) 


= e^ 


[Ð (t), ®,(t),..., 5, (f) are linearly independent.] Notice that if we define 
the eigenvalues of A as Ay, A», ..., An, not necessarily distinct, then the 
dij (5j = 1, 2, ..., n) are linear combinations of the terms like tre 
(A =0, 1,2,..., m, — 1, where m; is the multiplicity of the eigenvalue 
Ni.) Hence for ¢ > 0 the $;, may be differentiated with respect to f as many 
times as we need. 


| 


In terms of the ¢,,(t), x,(¢) is obtained as 


X(t) = $i) O) + $:.0)x«0) + +++ + ó:Gx«QO) 
+ fies — Du) dr (6-50) 
If the initial conditions are zero, or x,(0) = x,(0) = --- = x,(0) = 0, then 
x(t) = ia e(t — tur) dr (6-51) 


Notice that œ,» (t — 7) is the impulse response function of Eq. (6-49). 
To verify this statement let us write the transfer function of the system given 
by Eq. (6-49). 


Aa) 1 
U(s) s -+ as™! + --- + sas + ds 
Let us put 
Ta 1 
5e ES s” T ase + = Saf S Qn-\S + An 
Then 
X,(s) = H(s)U(s) (6-52) 
The inverse Laplace transform of Eq. (6-52) yields 
x(t) = f * h(t — ur) dr (6-53) 
0 
where h(t — 7) — impulse response function 
Comparing Eq. (6-51) and Eq. (6-53) we see that 
O $a(t— 7) = A(t — 7) (6-54) 


Notice that 
$t) h(t) 
$a) a | 
e.t) = : = ‘ 


à (n-1) | 
Panlt) h (t) 
Referring to Eq. (6-50) and Eq. (6-54) the solution of Eq. (6-49) can be 
written as follows: 


x(t) = du (Ox(0) + da (03(0) 4- -+ + di) x (0) 
4 k h(t — 7)u(7) dT 


We can easily extend the analysis made for the linear time-invariant sys- 
tem to the following linear time-varying system : 
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(n) (n—1) 

x bai) po. + BUDE d epee (6-55) Notice that | 
where the a,(t) are absolutely integrable as functions of ¢ in the interval h(r4-, 7) = akt, 7) MEE adc ED | 
to « C t. The response x(t) depends on both the time of application of ái GE Men apt dene 
the input, 7, and the time of measurement, £. The response of this system can a*h(t, T) = 
be given by BENTL ad 

i-T 
x(t) = dut, fy) + dtt, 309) + > + dut) x (5) | and 
, a" "h(t, 7) 
+f A, r)u(r) dr Em m E. 


where 


A(t, T) = dut tnis, t) + dust, toWon(Ts to) s - d s s response function Consider the 
inear time-invariant system defined by 


+ Piste free, to) + +++ + dat, fo), to) 
, * a es 0 
Here the $,,(t, to) and yrin(r, to) are defined by X = Ax + Bu, x(0—) 


m where X — n vector (state vector) | 
Pitts to) Pilt, to) --- inl t, to) u = scalar (forcing function or input function) | 
Pails to) paalt, to) -+ on(ty ty) A=n Xn constant matrix 
e(t, t) = s . ee B =n x 1 constant matrix 
i Let us obtain the response of the system to the unit impulse input. 
j For the unit impulse input 
Lonilt, to) drolt, ty) ros hanlt, to) i É u(t) ues òlr) 
and "is 
= As discussed in Example 6-4 the solution x(t) is obtained as 
Wiil7s to) Wilts fo) ++ WinlT, to) " 
WarlTs fo) Aras, fo) +++ rar, fy) x(t) = e^'x(0—) + f. e^t-? B&(7) dr 
@-! .f) = i 4 k | 
(T. 79] I — e^ B, t = 0 (6-56) 
: : Notice that the state space trajectory will exhibit a Jump at t = 0. 
LYni(Ts 1o) Welt, te)” o  Wan(Ts to) 
If the initi m "— Example 6-8 Obtain the response of the following system: 
the initial conditions are zero, or x(t) = X(t.) = +++ = x (ta) = 0, then z iat DE + abx =u, x0) —3(0—) =0 (6-57) 


the response of the system of Eq. (6-55) to any u(t) can be obtained from 


x(O) = f. At, yu) de 


| 

t 

| where a and b are constants (a = b) and u(t) = S(t) = unit impulse. We shall 
obtain the response of this system by two methods, (1) the conventional Laplace 
transform method; (2) the method discussed in this chapter. 

| (1) The transfer function of the system of Eq. (6-57) is 

| 

' 

| 


The impulse response function A(t, 7) satisfies the following equation: 
X(s) 1 


U(s) s? +(a+ bjs + ab 


h(t, 7) -- a(t) AG) Ter a, (t)? T) 
et The impulse response is, therefore, given by 


or” pri 
+ a(t A(t, T) = S(t — 7), t; &C T t X= 1 1 -~4) 
9-1— 6 s+b 


with 
h(r—,7) = 2, T) CE "A(t, T) = Taking the inverse Laplace transform we obtain 
* E. n-1 Ex 
i e ou x(t) = z petat) 


` (2) Equation (6-57) can be rewritten as follows: 


ie S p e isl hel * h | [a] 


(6-58) 


where X, =X, X% =X 


In order to simplify computation of e^, let us employ the following transformation: 


Pig etel 


Equation (6-58) then becomes 


—b —1 
^ -a O||» a—b a-b||9 
= + P 1 [u] 
" 0 —b i. 
ie " a—b a—b : 
—13 
=a Ojj y 
a—b 
= + i [u] (6-59) 
0 —bl||». wu m 
a—b 


Referring to Eq. (6-56), the solution of Eq. (6-59) to the unit impulse input 8(r) is 
given by 


y(t) = eAB 
where 
— 1 
eu 0 
litus 1 Bolle b 
0 et i 
a—b 
Thus 
et 
y(t) |^ d 
-bt 
o(t i 
y(t) —X 
Hence the solution x is given by 
e^t 
Xi he 1 1 s —L 
-bt 
x —a —b nep. 
" a—b 
al en at e*t 
a a—b a—b 
üg o^ — Dg 
a—b a—b 


and the impulse response of the system is given by 


x(t) = x(t) = (ee + e) 


We shall next consider determination of the impulse response function 
of a linear time-varying system by use of an example. 


Example 6-9 Consider the following system: 
x + a(t)x = &t — 7), x(t <T) =0 


In this system the unit impulse input is applied at time 7. We shall obtain the 
impulse response function of this system by two methods, (1) the conventional 
method, (2) the method discussed in this chapter. 

(1) Let us define the impulse response function as A(t, 7). Then 


2h07) +. aleyhe, 7) = Sle — 7) (6-60) 


OxcTEt 
The homogeneous equation 
OMT) |. a(ryh(t, 7) = 0 
ét 

has a solution 

In [A(t, 7)] = -Í alt)dt + El), >r 
where £(7) is a function of 7. Thus, 

Alt, 7) = Wren tnt, 

where (7) = ef. Since in this problem x(t < 7) is given as zero, we have 


fT 


h(t,7) =0 for Dev 
Therefore we may write 
A(t, r) = rye“ Je", — £2 
=0 ic 
Hence 
Int 7) — aqe — x)n(rje Jtt — aedn(mpen fe", £20 (gi) 
=0 akai: pamm 
Substituting Eq. (6-61) into Eq. (6-60), 
Slt — 7)n(r)e- Jat = (t — 7) (6-62) 


At t — 7 we have 
aren Jar zl 
from which 
n(r) = efacar 
Clearly, Eq. (6-62) is satisfied whenever 7 = t, since &(t — 7) = 0 for 7 Æ t. There- 
fore the impuse response function is obtained as follows: 
h(t,7) = ef oca- faciat 

= eli, acts Jj, aces = e- fraes, tT 

= 0 Dev 
(2) The system equation can be written as 


x = —a(t)x + 9(t — 7), x(t<7)=0 
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The impulse response function can be expressed as 
A(t, T) = p(t, to) Cr, to) 
where ó(f,1,) is the state transition function (one-dimensional state transition 
matrix) and (7, to) = 7"(r, tọ). Since a(t) is a scalar, a(t) and fi a(s)ds commute 
for all t. Hence 
j t 
$(t, to) = elu oes 
Then 
S (s)ds 
VG, to) = $7 (v, 1) = eine 

The impulse response function is then obtained as 

t T t 

h(t, 7) = e7 fee as+ fy awas = e Jas. tT 


We shall next consider the response of the system studied in Example 
6-9 for an arbitrary input. 
Example 6-10 Obtain the response of the following system: 
X + a(t)x = u, AL =0 


where u is an arbitrary input. 
The response is given by 


xt) = [^ ht, uta 
— f. elt a(s)ds+ ik SES rq 
to 


t t T 
= efus ii elu" 9 dr, >n 


6-7 VECTOR MATRIX DIFFERENCE EQUATIONS 


Mathematical representations of discrete-time systems are very often given 
in the form. of vector matrix difference equations. Formal solutions of vector 
matrix difference equations can be obtained by techniques similar to those 
discussed in Sections 6-2 and 6-3 for linear vector matrix differential equa- 
tions. 

In analyzing discrete-time systems, without loss of generality, we can 
assume the sampling period T to be equal to unity. In order to simplify 
notation we shall employ this assumption in the following analysis unless 
otherwise stated. 

The equation with which we shall deal in this section is of the following 
form: 

x(k + 1) = G(k)x(k) + H(&)u(k) (6-63) 
where X — n vector (state vector) 
u-—r vector (control vector) 
k — integer which indicates kth sampling instant 
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G(k) — nx n matrix 
H(k) =” xr matrix 
Equation (6-63) is equivalent to a set of n difference equations of the fol- 
lowing form': 
n T 
x(k -+ 1) = Z gy(k)x(k) + P hi k)uk) 
f - 
We shall first present solutions of homogeneous vector matrix difference 
equations and then give solutions of nonhomogeneous vector matrix dif- 


ference equations. Finally, we shall discuss discretization of vector matrix 
differential equations. 


Homogeneous vector matrix difference equation Consider the 
following linear homogeneous vector matrix difference equation: 


x(k + 1) = G(k)x(k), — x( = X, (6-64) 
(k=h,h+1,h+2,...; h = integer) 


Similar to the case of linear vector matrix differential equations, an n X n 
matrix X such that its n columns consist of n linearly independent solutions 
of Eq. (6-64) is called a fundamental matrix. It is nonsingular and satisfies the 
matrix difference equation 


X(k + 1) = G(k) X(k) 


Notice that n columns of a fundamental matrix constitute a basis of the 


n-dimensional vector space. i 
Consider the unique fundamental matrix (k, A) which satisfies 


Dk + 1,4) = G(A)@(K, h), Oh, 1) = I 


Any fundamental matrix X(k) can be written as @®(k,h)C where C is an 
n X n nonsingular matrix. Notice that ®(k, h) has the property that 


O(h,h) — I 
D(A + 1, h) = G(A) 
(h + 2, h) = GA + 1) GA) 
(h + 3, h) = Gh + 2) G(h + 1) GA) 


Dk, h) = G(k — 1)G(k — 2) --- G(h + 1) G(A) 
Using ®(k, h) the solution of Eq. (6-64) is given by 
x(k) = eK, h)x(h) = B(k, h) x, 


YThe solution of this equation is well known; see, for example (6-8). [Numbers in 
parentheses refer to list of references at the end of the chapter.] 


If G(k) is an n x n constant matrix, which we shall denote by G, then by 
choosing A = 0 and writing ®(k, h) as ®(k) we obtain 


DB(k) = G* 
®(k) is the unique fundamental matrix satisfying 
(k + 1) = GË(k), (0) — I 
The solution to 
x(k + 1) = Gx(X), x(0) = x, 
is therefore given by 
x(k) = G*x, 
Note that if G is a diagonal matrix, or 
G = diag (Ai, Ae, ..., An) 
then the unique fundamental matrix ®@(k) is given by 
@(k) = G* = diag Q4, A5 ..., AK 


Nonhomogeneous vector matrix difference equation Consider 
the nonhomogeneous vector matrix difference equation 


x(k + 1) = G(k)x(k) + H(k)u(&), x(h) = x, (6-65) 
(K=hh+l,h+2,... ; A = integer) 
The solution of Eq. (6-65) is given by 
x(k)=(k, h)x,+@(k, h) X @-( J, AYA j—1)u(j—1) (kh) (6-66) 
where ®(k, h) is the unique fundamental matrix satisfying 
SK + 1, A) = G(k)®(k, A), D(A, h) — I (K=h,A+1,h42,...) 


and H(&) and u(4) are zero for k < h. 


We can prove this by direct substitution. For k > h, we obtain from Eq. 
(6-66) 


x(k + 1) = ®(k + 1, A)x, 
+ Ok --10)3 OU, )HG—Dwj—1) 6-67) 
Noting that E 
(k, h) = G(k — 1)G(k — 2) --- GA) 
Eq. (6-67) can be expressed as follows: 
x(k + 1) = G(k)®(k, h)x, + G(k)@(k, 5) 3, T Jj, NAG — 1)u(j — 1) 


+ Bk + 1, Ok + 1, AH(K)u(k) 
= G(K)X(k) + H(k)u(k) 


Thus, we have shown that Eq. (6-66) is the solution of Eq. (6-65). From Eq. 
(6-66) we notice that the solution to the nonhomogeneous vector matrix 
difference equation consists of two parts: the first is the homogeneous solu- 
tion, the second is a particular solution. 

For the linear time-invariant vector matrix difference equation 


x(k + 1) = Gx(k) + Hu(k), x(0) = x, 
the solution, Eq. (6-66), is simplified as follows: For k = 1, 2,3,... 


x(k) = Gx, + G* X; G^Hu(j — 1) 
= Gx, + 3 Gt Hu(j — 1) 
j= 


This solution corresponds to Eq. (6-21). 


Discretization of vector matrix differential equation A con- 
tinuous-time system which is described by a vector matrix differential 
equation can be approximated by a vector matrix difference equation. Such 
approximation is often employed in practice. In particular, if we wish to com- 
pute system responses to various inputs by the use of a digital computer, 
we must convert a continuous-time system to a discrete-time system. The 
discrete-time solutions must be valid at equally spaced sampling instants. 
In discretizing the continuous-time system we assume the inputs or the 
forcing functions to change only at these same instants. | 

The following material presents procedures for deriving the difference 
equation model from the differential equation model. Consider a system 
whose dynamic behavior is governed by the following vector matrix differ- 
ential equation: 


X = Ax + Bu goo 


where X — n vector (state vector) 
u = r vector (control vector) 
A=n X n constant matrix 
B=n xr constant matrix 


If the components of x are measured only every 7 seconds, the system is a 
discrete-time system. In the following, in order to clarify the analysis we 
shall use the notation KT and (k + 1)T instead of k and (k + 1). 

The linear discrete-time system can be written in the following form of the 
vector matrix difference equation relating the state x at the sampling instant 
(k + 1)T to its state at KT, T seconds before: 


x((k + 1)T) = G(T)[X(AT) + F(T)u(kT)] 
= G(T)x(kT) + H(T)w(kT) (6-69) 
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where G(T) is an n x n constant matrix whose elements are functions of the where G(T) is obtained as 
sampling period T, F(T) is an n X r constant matrix whose elements are 
also functions of T, and H(T) = G(T)F(T). The recursive relationship given 
by Eq. (6-69) allows us to compute the state at time (k + 1)T from the 
state at time AT and the control vector at time KT. To determine G(T) and 


F(T) we may use the solution of Eq. (6-68), or 


G(T) = e^" = E iu — m 


0 ent 


and H(T) is obtained as 
H(T) = ( |? e^ dr) B 


TESSAN 


x(t) = e^'x(0) + e^! É e-^'Bu(7) dr 


We assume that all the components of u(t) are constant over the interval 
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between any two consecutive sampling instants, or u(t) = u(kT) for the kth 


sampling period. Then, x((k + 1)T) can be obtained as follows: 
x((k + T) = e^x(KT) + e^ f 7 e-^rBu(kT) dr 
0 
= exer) + f  e^Bu(kT) dX 


where A = T — 7. (For the derivation of this last equation, refer to Prob. 
A-6-28.) If we define 


G(T) = e^t 
F(T) = (f e^ dr)B - [ee f : e^ dr)B 


H(T) = G(T)K(T) = ( f à e^ dz)B 


and 


then 
x((k + T) = e^'x(kT) + e^ (^ e-^'Bu(kT) dr 


2 ert[x(kT) +f : e-^'Bu(kT) dr] 
= G(T)[X(AT) + F(T)u(kT)] 
= G(T)X(kT) + H(T)u(kT) 


which is Eq. (6-69). Thus Eq. (6-68) reduces to Eq. (6-69). [Notice that Eq. 
(6-69) represents the solution of Eq. (6-68) only at discrete points of time 
t=kT;k =0,1,2,...] 


Example 6-11 Given the following vector matrix differential equa- 


Xi 0 d p: | | 0 | 
= u 
bic eh? La 
obtain an equivalent discrete-time representation of the system. 


An equivalent discrete-time system is given by 


x((k + 1)T) = G(T)x(KT) + H(T)u(kT) 


tion: 


Hence 


be + A 


x,((K + 1)T) [ x dui pec] 


0 e-iT LAREJ 


yat n 
+4 2 — p«r) 


] — e7?T 


In computing G and H we may use the Laplace transform method. 
Assuming u(t) to be constant over each sampling period and shifting the 
origin of the time axis by putting 7 = 1 — kT, we obtain the Laplace trans- 
form of Eq. (6-68) for the period 0 < 7 < T as follows: 


| sX(s) — x(0) = AX(s) + —Bu(0) 
where X(s) = L[x(7)] 
The inverse Laplace transform of X(s) then yields the solution of Eq. 
(6-68). Namely, 

| x) = 2^ [Xi)] 

| = L£~'[(sI — A)']x(0) +g er — A)" | Bu(0) 

| where 

| 

| 

| 


(sI — A) = Aj [cofactor of |sI — A |] 


[s1Y — A] r- 
G(7) and H(7) are thus obtained as 
G(r) =Z [sI — A)7] 
and 
Hao) =2"! E 3 ay |B 


UY IATER 


or more conveniently 
H(z) = [eter — A)"]Bde 
= f G(o) Bde 
0 


To obtain G(T) and H(T), 7 is set equal to T. 
Example 6-12 Consider the following system: 


£D Bo dep 0 
na 0 O 1j|x|+]|0][4] 
Ad. Leg ATE a] i1: 


Let us obtain the explicit forms of G(7) and H(7), where 
X(KT + 7) = G(r) x(kKT) + H(7)u(kT) 
The elements g;;(7) correspond to the elements of 


Z^ [(sI — A)71] 
where 
0 1 0 
A= 0 0 1 
—6 —11 —6 
Thus 
s —l 0 
sI—A=!0 Ss —1 
6 ll s+6 
and 
s(s + 6) + 11 s+6 1 
(sl — A)! = X —6 s(s4-6) s 
—6s —115—6 s? 
where A = (s + Ds + 2)(s + 3) 


Taking the inverse Laplace transform of each element of (sI — A)-! we obtain 
Je-* — Be. e78t — Ba-T — 4e-?t .L $e Jet —e-?* + peT 
G(r) =| —3e7* + 6e~** —3e-*  —$e-7 + Be-27 — 90-37 —ie^' + 2e-2* — 3e-3t 
3Je^" — 12e-*7 + 9e-3"  $e-t— 16e-27 + 2%e-3T Jet de-2 ferir 
The elements of H (7) are found from the expression 
Hr) s. f : G(c)B do 


where 


Thus, 
—ile^* ie — de77 Ld 
H(7) = le^ — e^7 y je? 
—ie^* Ez 2e-?7 bm jest 


EXERCISE PROBLEMS 
AND SOLUTIONS 


Problem A-6-1 Prove that if x,(t), x(t), ..., Xm(t) are m solutions of 


(n) (n-1) 
L(x) = x + a(t)x + +++ + asa) + ar) = 0 
then c,x,(t) + cox.(t) + +++ + CmXmlt), where c,c5...,c, are arbitrary con- 
stants, is also a solution of L(x) = 0. 


Solution The linear operation L(x) has the property that for arbitrary 
constants @ and £ and arbitrary functions f and g 


L(af + Bg) = aL(f) + BL(g) 

Hence 
L(eix, + Xg Ho-r F CmXm) = L(x) + CoL(X2) + + +> + Cm L (Xm) 

By assumption, xi, Xə, ..., Xm are solutions of L(x) = 0. Therefore, L(x,) = L(x) 
= ++. = L(x,,) = 0. Hence 

L(eixi + CX» + 22? + CmXm) = 0 
This implies that cix, + CoX. + +++ + CmXm is a solution of L(x) = 0. 

Problem A-6-2 Show that a general solution of a linear differential 


equation L(x) = X(r) is given as a sum of z linearly independent solutions of 
L(x) = 0 and a particular solution of L(x) = X(t). 


Solution Let x,(t) be a particular solution of L(x) = X(t) and y(t) 
be a general solution of L(y) = 0. Then 


L(y + xo) = L(y) + L(x) = X(t) 
Thus, y + x, is a solution of L(x) = X(t). Since y(r) involves n arbitrary constants, 


y + x, also involves z arbitrary constants. Since no singular soiution exists in this 
linear system, y + x, is a general solution of L(x) = X(t). 


Problem A-6-3 Consider a system defined by 
X(t) = Ax(t), X(f) = Xo (6-70) 


where the eigenvalues of A are distinct. Show that a fundamental matrix can be 


given by 
ehilt-tode, eet-tüc, s.e etae 


ehlile,  pAmtac,, vee plns 


X(t) = 


gute, EAEC vse erate 


344 OBTAINING THE SOLUTION OF THE STATE SPACE EQUATION 


and the unique fundamental matrix (z) which satisfies 


Ë) = AP), Pn) =I 
can be given by ' 
Ci Cg ct Cy | [enit | O} fe Ca v €] 
Cor Cop ttt Con ei-to) Coy Cop t** Con 
Dr) = 
Cui Cro ctt Cun 0 eM tto) Cm Cn =e Cnn 
Solution Let us assume the solution of Eq. (6-70) to be 
x(t) = eMt-toc (6-71) 
Then, 
X(t) = XeXt-t9c (6-72) 


From Eq. (6-71) and Eq. (6-72), we obtain 
AeMt-t)e = AgXt-t)c 


or 
(A —ADc— 0 
which may be written as 
an — À Qi ag ce Asn € 0 
Qo; Qo — X ag ce on Co 0 
ies) eee 
si ana Ang ct Amm — X Cn 0 
Equation (6-73) has a nontrivial solution c + 0 if and only if 
|A —AI| — 0 


This is an nth-degree polynomial in X and is the characteristic equation of the given 
system. Notice that for each A, there corresponds an eigenvector c,. The coordinates 
of the vectors c; can be obtained by solving the following n simultaneous equations 
with X = Aj: 

(ari — Aghere + aizi + arC + ee + d;4C5 = O 

Ci + (Gag — Mi) + dosCst + *** + aonni = 0 


GniCji T. Anni F Anges, + +++ + (Gan — Ayden, = 0 
(£e 1,253; 5, n) 
Then, the vectors 
e^ to) CH 
eùt- to) Coi 
X(t) = eile, = 


etc, | 


opp pe ne renee — 
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are n solutions of Eq. (6-70). Since the x,(¢) are linearly independent over any 
interval, the fundamental matrix is given by 


gill e, ete, ento) c, 


gre, pnto Cog eto, 


X(t) 


ll 


ghü-toc eito Cng gt-t)c, 


Cy Cim °°° Cm eoi) 0 
-t 
Coy Cog ttt Con ght-t) 
An(t—t 
Cni Cn ^77 Cnn 0 e^ o) 


Since the unique fundamental matrix (r) which satisfies 
P(t) = A), | $() =I 
where A is an n X n constant matrix, can be written as X(r) times a constant non- 
singular matrix, we obtain the following relationship: 
er) = XIX) 
Cyn Cip ^77. Cin eha-t) 0 Cy Cin 77 Cin 


Cor Cop 777 Con eut) Co, Cop °°* Can 


A,(t-t is 4 
Cua Gig = Capel LO ett) | | Car Cue Can, 


Problem A-6-4 Consider a system described by the following transfer 


function: 
X(s) _ K 
U(s) (s t+ a)(s + b)(s +c) 
Find a few sets of basis functions. 
Solution Examples of a few sets of basis functions are 


1 1 1 
(1) sta s+? ste 
1 1 1 
D  cca3ce-cBG-cTco Gracthy Gra) 
s? s 1 
GO GractDero G-cXaGc8Gco GXaG-B5G-9O 
Problem A-6-5 Are the following two systems equivalent? 
p WI. GB 
O WA G+ a6 +5) 
AS) (s +c) 
O orerar CFO 


cancel the common terms in the numerator and denominator, but they have differ- 
ent basis functions. The basis functions of system (1) are 1/(s + a) and 1/(s + 6) 
| (or their linear combinations); those of system (2) are 1/(s + a) and 1/(s + c) (or 
| their linear combinations.) Two systems are equivalent if the transfer functions 
are the same and if the basis functions of one system are linear combinations of the 
basis functions of the other system, and vice-versa. The present systems are, there- 
fore, not equivalent. 


Problem A-6-6 The elements of the fundamental matrix @(r) with 
(0) = I may be generated by the use of summers and integrators. Consider the 
following sécond-order system: 


X + wonk + wx =u 


which can be rewritten as 


N Š n apai ri ý | I (6-74) 


| where Xx, =x and X. em 


Draw analog computer diagrams for the given system and indicate where thecle- 


Solution These two systems have the same transfer function, if we 
ments of the fundamental matrix d(r) may appear. 


Solution The elements of @(r) can be determined from 


$n dep. U — 1 qrén de 
[à Phe EN Fa E | 


L 


where 


[s s] "lo 1] 


Hence we obtain 


Figure 6-4 Analog computer diagrams for second order system defined by Eq. (6-74). 
(A) Diagram indicating ġıı and $5; (B) diagram indicating $;» and $2». 


t 
ou = IKZ dt+1 
t 
$r = f. hao dt 


Solution From the system equation X; is given by 


$n = NC: $i, — Konpa) dt XQ = —2XQ + 1X, (6-75) 
ev i 2 E : Differentiating Eq. (6-75) with respect to £, 
Pa = J, (otis 280,3.) dt + 1 gom cu dnb opis i 
Figures 6-4(A) and (B) are analog computer diagrams showing where the elements Let us put v = X,. Then 
of the fundamental matrix @(r) will appear. $—tv—0 


Problem A-6-7 Consider the system described by Solving this last equation 


t 
1/2)t? 
v mcele™ 4" = ce 


| 
X 0 1 x, | 
ls] - E. dM] Thus 
j xs (E). = IE = Cj f e0/97* dT + x4(0) 
Obtain the fundamental matrix @(r, 0) with &5(0, 0) = I. : : 
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x(t) = [x dt =e, f, E eum: dr dE + txy(0) + x,(0) 
Since at t = 0 
¥,(0) + x,(0) = 0 


it follows that 
cı = —x,(0) 


Thus we obtain 
x(t) = x(0)[1 — Í i (s ev" dr d£] + tx, (0) 


x(t) = —x(0) | et dr + x. (0) 


Hence, 


1—f ‘ fien drdb t 
@(t, 0) = ss 
-f eum dr 1 
0 


Problem A-6-8 Obtain the fundamental matrix (z) with «b(0) = I 
of the system 


X-—ax 
where a is a constant. 
Solution Let 
xX, =x 
X, — X 
X; = X 


then the system equation becomes 
Fi 05 5T ET AE Xi 
X,|2|0 0 1|| x;|—A l^ 
X, a 0 Oj Lx, 


Computing A?, A’,..., we obtain 


0. 0 d a 
A =a 0 Q9] A: =|0 a 0 
0 


lo a 0 0 
0 aod 0 0 a ao 0 
A'2|0 0 al, A®=la? 0 0, at=l0 æ O 
ar 95. 0 [b a 0 0 0 æ j 


If we define (r) = e^t = [x,| xy! x;], then the column vectors x,, x;, and X3 
are obtained as 


2 
L+H vU ee 
2 3 


a? a? 
aC xy II emt E 
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2 
puppe 


2 
x, = Ee ee x6 


2 3 


AU +5" Pg pee 
x- t+Ẹ" t$ ee 
1 ASI es 
Problem A-6-9 (6—4) Consider a linear system described by 
x = Ax 


where x is an 7 vector and A is an n x n constant matrix. Utilizing the method of 
successive approximations, an approximate solution of the system equation can 


be given by 
x(t) = f’ Axs-i(r) dt + x) 


With the initial approximation x,(t) = x(0), obtain the nth approximation to the 
solution. Show that as 7 is increased to infinity the solution approaches e^'x(0). 


Solution The successive approximations are 


x(t) = x(0) 
xi(t) = (I + At)x(0) 


x,(t) = (1 4+ At+ Att) x(0) 


x,(t) = (1 +At + QAM: Esse J A^ t^)x(0) 


As n — co 
x(t) = lim x,,(t) 
= lim (1 + Ar A AR he ven g F A^r)x() 
ned ' 2! n! 
— ` 1 nen 
= 2, 7A t™x(0) 
= e^*x(0) 
Problem A-6-10 By use of Method 4, Section 6-4, compute e^, 
where 
L2 E 4 
A=|:0 2 0 
QUA 34 


Solution The eigenvalues of A are 2, 2, 1. The characteristic poly- Define 


nomial and the minimal polynomial are the same for this A. The minimal polynomial Bi des ood 
is given by $ $ $ 
: QT i. 
$A) = (2 — X)'*(1 — X) W(t) = ; 
In this example, e* can be written as (n-D (a-b (nn 
: EE É 
M us 4 * 
em = Mol + oA + A? where ds, $», - - -, $n are n independent solutions of the given system. Show that 
Equations for the determination of a, a, and o, are : -ff ait 
i E 1 W(t)--a(t)W(t) 20 or W(t) = W(tye f, ier 
E h oy * . 
Qi + 283, = te^ Solution Let us differentiate W(t) with respect to ¢. 
Ay + yA, + oA? = et (6-76) à à s é $i $2 c Ó 
1 2 n n 
Gy + AAs + aM = eht $ $ e 64 ài $a c Gri 
1 2 n : 
Substituting X, = 2, 4, = 1 into Eq. (6-76), W(th=| : * blo a E * 
Qu + 40s = te” lic (1-1) die Gi -— ean 
1 2 ttt n 1 2 exe." Pn 


a) + 2a, + 4a, = e” 
A +a, + Q = et 


(po 4e Be 
à à, XE e $n 


Solving for a, @,, and o, we obtain 


Qs = 4e! — 3e?! + 2tet oy ms oe 
a, = —4e! + 4e? — 3te* $i ge ce n 
as = et — ett + fett ^x ica - g 
j di gi Q Gn 
Hence, hic 2 c on 
I 0.0 $i = Qe. cms $n 
e^t = (4e! — 3e% + 218?) 0. 1 0 2 í . 
o o O (n=2) (n-2) (n-2) 
i um È 
pus g (n) (n) (n) 
: z [^ pe rae Pn 
+ a $e) 0 2 0 Notice that 
' n (n-1) (n2 “ 
iet $= —a( à; — a1) $i — «++ — ult) be 
4 16 12 2 Hence we obtain 
+(e — e” + teo 4 0 $i $: ems Pn 
Oa: | $i py sje $n 
et 12e! — 12e% + 13te% —det + 4e*t pum : a 
=| 0 2t 0 (n-2) (n-3) (n-2) 
E H 'tn-1) ‘a= "gi-n 
0 —3e! + 3e” e aiid, -cali$r c ad. 
= —a,(t)W(t) 


Problem A-6-11 Consider the system 
Integrating, 
(n) (n-1 ay(t)dr 


x af) x E e + a(t) =0 W(t) = W(ty)e7 St 
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Problem A-6-12 (6-9) Compute e^ where 


Q9 i 0 

f 0 0... 1 
A= 

000 —i 

0 1 0 0 


by use of Sylvester's interpolation formula. 


Solution We must first find the roots of the following characteristic 


equation: 
—X 0 1 0 
1 —X 0 1 
A —AI|— — m 2 = 
| bed oai paie ce Anis 
0 1 0 —X 
The roots are 
NIFI ow —¥ —3-—j V3 Hi 
2 7 Z 2 , 2 


Let us define 
—Ó E = eG — X. 


V cd = —eg LA, 
=y} = eA — , 


ve td = ene LX, 


Sylvester's interpolation formula can be written as follows: 


1 Ay AP oO et 
L Ar ME RE et 
1 As M X er") =—0 
l Bu. AF AE eM 
I A A? A? e^ 
Expanding the determinant.about the last column we obtain 
L X A N La M fe Xr 
ti A A M uL. | Ny AL Od FM Ny AE ON pit | 
LX 4 Ly ues lo XU 
D X; XO | A A A? IY A A* A? 
] A N ON EA M 
Si Ae M apl M Aau 
l Ac RE Od lI Ah X3 
I A. A® A? I A A? A? 
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The preceding 4 x 4 determinants may be evaluated as follows: 


1 Ay AY OM 
1 XA X2 33 
px xL 087904 7 904 7X) 
T3 is x + (Ag — A) Os — M) An — 2) 
4 4 
= —12 

1». M XM 

Ae UND. AS 
x X: hs = (A — D(A — XJ) (A — A4D 
1 ki A? js + (Ag — X3)04 — Ag) (As — Av) 


= (3 —j4/ 3) — 24/3 A +j2/S3 A? + (VS — j3)A* 
Similarly, we can evaluate the other determinants. Then e^' can be written as 
follows: 
eM = =i ll- 3 —jVS)1 + 27S A —j24//3 A! — (V3 —j3) Ae" 
[7G —j4/3)1 2 24/3 A — j24/3 A? + (3 — j3) AT" 
[7G j4/ 3)1 - 24/3 A o j24/3 A! — (SS o j3) Ae 
[TG +i 3)1 — 24/3 A -- j24/3 A? - (Q3 -Fj3)A7]e) 
Grouping real and imaginary terms and using the identity 
e? = cos 0 + j sin 0 
e^t becomes 


A3 
6 


At 


TUS 
e [eov [w3 I + 2A — A3) cos E 4+ (—1 +24? 4-4/ 3 A’) sin | 
serves iy 3 1—2A-+ A) cos + + (I — 2A? + A/ 3 A?) sin |) 
Denoting 
e 3/2)t 4 e7312% =k 
e 3/2) — ge- (3/2) =h 


e^ can be written as 


eM -fif 3 k cos + — h sin : ) + A2h cos 5 + A?2h sin 


$ A*(—heos + + VF ksin$)| 


Since A? and A? are obtained as 


0 00 -1 0 —1 0 0 
0 
nm 0 Lick 0 | perk 1 0 0 
0 —1 0 0 —1 0o 0 —1 
1 0 0 1 0 1 4 0 


e^t has the form shown in the following: 


E m h sin A - : s= 
za as ZI gg vw qe my era aes 
h t de: vit k t hh xvf s+1 
j E 243 cos > + 3 sin-- 29957 Ts Ly sm -- DPE 2)6G— 2) ES wee WJG T 73) 
! dii: h cos: E int EN. ain: Taking the inverse Laplace transform of Pe — A)^!, we obtain the solution x(r) 
i 2/3 2 2 3 3 as follows: 
i h t h £u. X t 
E VT sin WE cos To sin T x(t) 
D 
| f cost —— sik xr) 
1^ 9 wd 32 - 
2 1 e-vzt /zt 1 -/ZAO. 1 vat (0) 
Vv ev LI e T e X; 
3 sin- z cos KW. da E We 2 2 
Sl mt i egi wei UU GN E El ug x,(0) 
ea are e AT Le Er s 
£ cost 23 sin —- — - cost — £ sint 2:2. 2/2. 2 
h k 7 x " Problem A-6-14'- -Draw an analog computer diagram for solving 
-373 cos 2 + > sin ; z 60s ; -= 2 h 3 sin the following simultaneous linear equations: : 
i 11X1 + aX = D; 
By lett 
on k —2p 1X1 + aX = by 
h—24/34 
sin £ =a 
t 
cos 5. = b 
e^! can be simplified into 
pb —qa —pa-+qb 2qb —2qa 
" pa 4 qb pb + qa 2qa 2qb 
e = 
—pa 4- qb —2qa pb —qa —pa —qb 
2qa pa+qb pa—qb pb--qa 


Problem A-6-13 Obtain the solution of 


[ gi [ | [ i 
x 1 1 x 
by use of the Laplace transform method. 


Solution Let us define 
—1 1 
A= 
gs. 
x(t) = e^'x(0) 
where e^ is the inverse Laplace transform of (sI — A)-! which is evaluated as 


E: ax bot 1 i 1 Hg al 
a-a p smil Tap MD 


Then 


Figure 6-5 Analog computer diagram for solving Ax = B. 


1A technique similar to the one presented here can be used for finding the eigenvalues 
and eigenvectors of an n x n matrix A. For detail see (6-70). 
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Assume that 0 < a;; « 1, the b; are real, and 


41 Ai 


#0 


d» Ane 
Solution The solution of the given equation 
Ax — B 


x ai a b 
Ep mum [t] 
X» A21 Age b; 
can be obtained as a steady state solution of the following equation: 
X + A'Ax = A'B 
This vector matrix differential equation is stable and possesses a steady state solu- 
tion, X = A-!B. Figure 6-5 shows a possible analog computer diagram for the 


present problem. The diagram can be easily modified to solve n simultaneous 
equations with any real coefficients 4,; and any real b;. 


Problem A-6-15 Find the solution of 


where 


i B(r) = A()B() + B()A*(0),' - B(t) = B, 


where the elements of A(t) are continuous functions of time. 
Solution The solution is given by 
Bt) = P(t, t)B,o*(r, to) 
where ®(ż, /,) is the solution of the following matrix differential equation: 
FOE n) = AC t), Dat) =I 


To verify that ®(z, ,)B,@*(t, fo) is the solution, use direct substitution. Since the 
elements of A(t) are continuous, the solution is unique. 


Problem A-6-16 Show that the solution of the following matrix 
differential equation: 


dX —AX--XB,  X()- C 
is given by ` 
X =e^C eBt 
Solution Differentiate X with respect to t. Then we obtain 
zx = A eMC eB! | gA(CB eBt 


Since B and e®! commute, we have 
e^'CB eB! = eAtC gBtg 
Hence, 
dX 
ur AX + XB 


or X = e^'C e?' js the solution of the given matrix differential equation with the 
given initial condition. 


ee AES A Ret ed eere POE 
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Problem A-6-17 Consider the following equation: 
A'B + BA = —C (6-77) 


where A is a stable matrix (meaning that the eigenvalues of A have negative real 
parts) and B and C are symmetric matrices. Show that the matrix B satisfying 
Eq. (6-77) can be given by 

B= ix eA"C e^ dt 


Solution Consider the matrix differential equation 


X=AX+XA, XO)=C (6-78) 
The solution of Eq. (6-78) is given by 
X = eA"C e^t 


Integrating both sides of Eq. (6-78) from t = 0 to t = co, we obtain 
[Xa = jJ, ^xà + f7 xa dt 
0 0 0 
or 
X(co) — X(0) — A' (f: Xar) rs (f. xd) A 


Since A is a stable matrix, X(co) = 0. Therefore, we have 


-cw (fna) e (f;xa)A 


Hence, 
B= [ Xdt = js eA"C e^ dt 
0 9 
Problem A-6-18 Consider the following system 
(n)  (n-D : E 
x+ ax ++ +e + an-1ž + a,x = ù + bu 
(n-1) 
where x(0) = x(0) = --- = x (0) =0 
Show that if the forcing function u(r) is given by 
u(t) = 0, t<0 


=g 120 
then the response x(t) is identical with the impulse response function A(t), where 
A(t) satisfies S Xod 
htah+-++ +a,-,h + ah = X(t) 
Solution For the given u(t) we obtain 
u(t) = &(t)e~"* — bez" 
Hence for t >0 
u(t) + bu(t) = &(t)e~** — b e-*t + b e*t 
= &(t)e~ 
= &(t) 


Thus, the response of the system is given by the impulse response function A(t). 


jn a7 9 M READ 


Problem A-6-19 Obtain the response of the following system: 


X(s)_ Xs + 3) kP 
U() G-c-DG-3v  *0-30-0 


where u(t) — 0, t«0 
= et, t>0 


(1) Use the Laplace transform approach. 
(2) Use the partial-fraction expansion technique. 


Solution 
(1) The Laplace transform of u(t) is 
us)=— 
+1 
Hence 
2 


“ntre nT EI + opp tess 
The inverse Laplace transform of X. (s) yields 


x(t) = —2e7! + 4re-t + 2e-*t 


(2) 
XGQ.XG. 4 , -2 
U(s U(s) scil END 
sX(s) —2U() X(s  —4 , 4 
Us) | U() scl1'si2 
Hence, 
X,(s) 1 
U(s) £ * om s+] 
X,(s) 1 
UJ L^ *llz 
Let 
ped alc ie pe 
X,(s) | —4 Y,(s) 
Then 
Yi(s) A N 
U(s) _|s +1 
COP MEE. 
U(s) $42 


and y,(1) and y.(r) are obtained as 


wilt) = LLYN = 


xi = fe~ 


yt) = £-[Ys)] = L- M ab ow 
Then x,(t) and x(t) are found as 


Lone" Lease tao] " L sur e etr 


=t LL p-2t 
Bk cut y iiio: 


Notice that the given transfer function can be written in the time domain as 


follows: , 
X 3X 2x = 20 + 6u 


A state space representation of this equation can be given as 


[l-l lle}+Lo le 


Xo =X; — 2u 


Problem A-6-20 Obtain the response of the following system: 


07 
x EP 2 ] [xum " | | 
etal cella its JP x(0] L0 
where u(t) — 0, t<0 
=e", t>0 
Solution -- Let x = Py where 


a SE 
Then we obtain , T 
«s IGE Ale 
“fo 31) *LÀ)u 


y — Ay + Bu 
The solution of Eq. (6-79) is given by 


y(D) m e^t [yc) ¥ f, e-A"Bu(7)d7 | 
dim ie tA 
Thus we obtain m ; 
Dol" ei ily welll 
A u 
= Md os || e isl 


y(0) = P-'x(0) = 0 


ben E ip E ber ~ = Pe 


or 


where 


Since 


we obtain 
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Since 
1 1 4te7t 
x) = Py =| ll ] 
-1 -2]l -2e* + 26-2 
we obtain 


pid v. | 4te^* — 2e7! + 2e7*t ] 


x(t)) | L—4te-* + 4e- — 4e-2 


Problem A-6-21 Obtain the response of the following time-varying 


system: 
X+kXx=—u 
where u is an arbitrary input. 


Solution The solution of this system is given by either 


X(t) = dt to) (40) + (t, to) |" $7. tu(r)dr 


or 


x(t) = Olt, toate) + f^ Alt, muta 
Notice that 
P(t, 1)ó7 7, to) = A(t, 7) ~~~ 
In this system since ? and fe zdr commute for all t, A(t, tj) can be expressed as 
ét, to) = e Tt 7 4* = e-em- to 
Hence 
PTIT, t) = eC1m-uetm 
Therefore, 
h(t, 7) = e-m-cum, t>r 
=O $m 
The output x(¢) is thus given by 


X(t) = eT D-M k(t) + g- t |i e"? u(r)dr 
‘Problem A-6-22 Show that 
X + a(t)x + b(r)x = 0 
can be reduced to the following form: 
Z+c(t)z=0 
Solution Let 
x = ze- 0/2 i adt — zy 
Then, 
X = yz — la(t)yz 


X = y — a(t)yz — dá(t)yz + 1a*(t)yz 
Hence, 


X + a(t) + b(t)x 
= yU + [b(t) — 2a(t) — ia'*(0)z) 
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The equivalent equation is, therefore, 
E+c(t)z = 


where 


c(t) = b(t) — ált) — £a) 
Problem A-6-23 Show that the solution of the following scalar 
differential equation: 
X+a(t)x + b(t)x =u (6-80) 
x(t) = x(t) = 0 
can be given by 


t 
oaao no f gta 


t xulr)u(rT) 
iG) k W (x11, X12) die 
where x(t) and x(t) are linearly independent solutions of the homogeneous 


equation 
X + a(t)x + b(t)x — 0 


and W is the Wronskian given by 

xu() xu) 
Xa) xu) 
Solution Equation (6-80) can be written as 


ird Ñ = E E T b ] [x] (6-81) 


where x, =x, X, =x 


W (X3, X2) = 


The solution of Eq. (6-81) is given by 
d t 
x(t) = Blt, to) + (t, 1) |, DC, te) Bu d7 


where @(t, f,) is the fundamental matrix satisfying the condition that db(£,, fo) = I, 
and B is given by 


Since x(t,) = X(t.) = 0, we have x, = 0; x(t) is, therefore, given by 
t 
x(t) = ext, to) jn @O-\(7, t) Bud 


Since any fundamental matrix can be written as a product of db and a constant 


matrix C, we have 
is m - js E S Fa 
Xuo ži or Poad Leo Ce 


which is equivalent to 
= xc"! or @7 = CX“! 


8 ROTA 


where 
X= bs = ana C= k a] 


Xu Xi» 
Then x(r) is obtained as 
x(t) = X(1) C7: f. CX-i(7)Bu d7 
= X(t) f X-((7)Bu d7 
Notice that 


z 1 Xi —X12] 
X! = ——_. 
onu ee Xü ] 


Hence 


[se] - pz an 


x2(t) Xu) xj) 


t 1 —X12(7)] 
to Wx, z5l xu(7) | eer 


The solution x(t) of Eq. (6-80) is thus given by 


x1) = x0) = aut) f, 34909 a. 


t Xi (7) u(r) 
xt) f, Wry x) 47 


Problem A-6-24 (6-11) Consider the system 
X = Ax + Bu, x(0) = 0 


where x is an n vector, u is an r vector, A is an n X n constant matrix, and B is an 
n X r constant matrix. Suppose that the control vector u(t) is given by 


u(t) = X u(t) €; 


where €; is the unit vector which has a 1 in the jth place and 0’s elsewhere, and 
uy(t) is given by 


n=l (i) 
u;(t) = ge a;;$(t) 
G0 bn —1:]59 1,2,...., F) 
(i) 
where the a;; are constants and b(t) satisfies the condition that 


PRU C=O enin =p 


Show that 
x(T) = eA? i e-^'Bu(7) dr 
Ep cont = -ÅT i f 
= pA? £ 4; (f. $(7)e dr)A B, | 
where B =[B, B. | --- | BJ 


Solution By simple substitutions we obtain 


Tr - 
i e-^'Bu(7) dr = Y, i e-^'Buj(7)6; dT 
0 j=1 


r n-l T (i) 
Notice that " 
olrenst dr = (f. $(r)e7^* dz) A! (6-83) 


(25100, T,... 52:1) 


We can prove this by using induction on i and integration by parts. Clearly, 
Eq. (6-83) holds true for i — 0. If Eq. (6-83) is assumed true foriO<ci< n— 2), 
then by integrating by parts, and noting that A and e7^ commute, we obtain 


+ Q) [TT T (i) - 
D$ Qe dr oe $c) Ld + Í, $(7)Ae7^' dr 
T (i) T = » £i 
= (f; de^ dr)A = [s d()e7^* dr) A 
Hence, Eq. (6-82) becomes 
T AS EX T n-i T * dt dip AB 
jJ, 5047 = X X, a, ([, 80e ) j 
and we obtain 
T n-l T ee d AiB 
x(T) = e^t [> 2 a; (f $(7)e ) ;] 
Problem A-6-25 Consider the following system: 
x(k + 1) = G(k)x(k) + H()u(4), x(0) = Xo 
(k ='0, 1, 2)... ...) 
Show that the solution of this equation can be given by 


x(k) = G(k — 1)G(k — 2) --- G(Ox, 
+ G(k — 1)G(k — 2) +++ GCOBQ)u() 


Mc: 
+ G(k — 1)H(k — 2)u(k — 2) 
+ H(k — Duk — 1) 


In particular, if G(k) and H(&) are constant matrices, then 


x(k) = G*x, + G*^Hu(0) + G*7*Hu(1) 
+ +++ + GHu(k — 3) + GHu(k — 2) + Hu(k — 1) 


Solution The solution to the present difference equation is given by 
Eq. (6-66). By substituting A = 0 in Eq. (6-66) we obtain 


x(k) = Blk, Ox. + BK, 0) 2; b^, HG — Du — 1) 


Si 
aid @(k, 0) = G(k — 1)G(k — 2) «++ G(0) 
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we have 
Dk, 0) B-'(j, 0) = G(k — 1)G(k — 2) --- G(00)G-(00G-*(1) --- GG — 1) 
= G(k — 1)G(k — 2) +++ GC) 
Hence. 
x(k) = G(k — 1)G(k — 2) --- G(0)x, 
+ G(k — 1)G(k — 2) --- G(1)H(0)u(0) 
— eee 
+ G(k — 1)H(k — 2)u(k — 2) 
+ H(k — 1)u(k — 1). 
If G and H are constant matrices, then 
x(k) = G*x, + G*-! Hu(0) + G*-?Hu(1) 
+ +++ GHu(Kk — 3) + GHu(k — 2) + Hu(k — 1) 
Problem A-6-26 Obtain the solution of the following difference 
equation: 
x(k + 2) + ax(k + 1) + bx(k) — 0 (6-84) 


where a and b are constants and k > 1. The initial data are given as x(0) = A, 
x(1) = B. 


Solution In this solution we shall present a classical method for 
solving difference equations. Let us define 
Ex(k) — x(k 4- 1) 
E?x(k) = x(k + 2) 
(E is a unit delay operator.) Then Eq. (6-84) becomes 
(E? + aE + b)x(k) = 0 
Let us put ‘ 
x(k) = Cq* a : (6-85) 
where C is an arbitrary constant and q is a number to be determined so that Eq. 
(6-85) is a solution of Eq. (6-84). Since 
E*x(k) = E*Cq* = q*Cq* = q'x(k) 
we have 
gt+ag+b=0 (6-86) 
or 
q-—-m or q =m, 
where m, and m, are roots of Eq. (6-86). Hence, 
x(k) = Cymf + C,mi 
The constants C; and C, are determined by 
x(0 =A=C,+C, 
x(1) =B= Cim, T Com, 
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Hence, 
C, = Ars — B 
|^ m — m, 
Am, — B 
os 
m, — ms 


Thus the solution x(k) is obtained as 
—B_, , Am, —B,, 
MO Fm am PR 
Problem A-6-27 Consider the continuous-time system characterized 
by X = u. Discretize the system and derive a vector matrix difference equation for 
the system. 
Solution By defining x, = x and x, = x, the given scalar differential 


equation becomes : 
Xi 0 4 iri E ] 
= + u 
i iF 0 Xs 1 a 


or 
x = Ax + Bu 
The discrete-time equation is 
x(kT -H 7) = G(7)x(kT) + H(7)w(kT), 0cvscT 
where T is the sampling period. The elements of G(r) correspond to the elements of 
-£-1[(sY — A)“ ] 
where 
0 1 
A= 
lo o 
Since 
s —1 
]I—A-— 
5 [o s ] 
and 
1 I 1 
1 3 E F 
(sI — A)! = n = i 
0 s 0 — 
sS 
we have 


1 
G(r) = £-[(s1— A) = | d 
0 1 
The elements of H(7) are found from the expression 
H(7) = f : G(c)B do 
| 


where 


«os; 1]h]-i] 


Hence H (7) is obtained as 


Thus the discrete-time equation is 
x(KT + d k " [I e 
= T. 
bed +7] lo 1 nel "La ] WT] 
At the sampling instants, 7 equals 7. Therefore, we obtain 
xy((k + 2 [o 1] D iT? 
= T 
bei nnllo 1) sn] * Lr rars 
Problem A-6-28 The vector matrix differential equation 


X = Ax + Bu +y (6-87) 


can be discretized and the equation can be written as the vector matrix difference 
equation of the form 


x((K + 1)7) = G(T)x(kT) + H(7)u(k7T) + N(T)WKT) 


Determine G(T), H(T), and N(T). Assume that A and Bare constant matrices, and 
u and v are constant between any two consecutive sampling instants. 


Solution The formal solution of Eq. (6-87) is 
X(t) = etxo + e^t [e-^r[Bu(s) + v(7)] dr 
Att = (k + 1)T, and t = KT we have 
x((k + 1)7) = gU FDE ob eACkKtDT for eras 


-[Bu(r) + v(7)]d7 (6-88) 
and 


X(KT) = etr, + ener ^ e7^*[Bu(r) + v(7)] d7 (6-89) 
Multiplying Eq. (6-89) by e^7 and subtracting from Eq. (6-88), we obtain 
(k+1)7 


x(k + DT) = e^" (AT) + eater (77 e-Ar[Bu(z) + v(7)] dr 


= eATX(kT) + eA? j E e7^ B dt u(kT) + eAT f Set dt W(kT) 


= G(T)x(kT) + H(T)ukT) + N(T)v(kT) 
Hence 
G(T) = eå? 
H(T) = e^7 ji e-^ B dt = f e^' Bdr 


N(T) = eA? freta = [peas dr 


PROBLEMS 


Problem B-6-1 Obtain the fundamental matrix ®(r) with @(0) = I 
of the system 
ae se ad 
where a is a constant. 


Problem B-6-2 Consider the following system: 


X = A(t)x 
where f 
a e-*] 
AB P^ b j 


a, b — constants 
Obtain the fundamental matrix @(r, 0) with @(0, 0) = I. 


Problem B-6-3: Assume that ¢,(t), $.(t),...,,(¢) are any set of 
functions having at least up to the nth derivative with respect to z over the time 
interval 7, « t < t,. Assume also that the Wronskian does not vanish. Namely, 


$i $: — Pn 
dy be cee On 


Wd, go, sey Pn) = s& 


(n-1) (n-1) (1-1) 
s MEL s 


Show that 


W(di, Pos ee. 3 Pn) 


is an nth-order linear scalar differential equation in x with the coefficient of the 
highest derivative term in x equal to unity. 


Problem B-6-4 By use of Sylvester's interpolation formula, compute 


e^t where 
0 1 0 
A= 0 0 1 
—6 —1]l —6 


Problem B-6-5 Consider the following system: 


rei Tr 9 3] [* ] 
b | 2 [ —2 —3 Xa 
Show that the fundamental matrix c(t) with (n) = I is given by 


2e-(t-tà — g-3t-t) e^ 4-1) — e~it) | 
—2e--t) 4. 2e-34t-t) —e7 76) 4 2e7 30-09) 


D(t) -[ 
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Problem B-6-6 Obtain the response y(t) of the following system: 


X; —3 —2 —1 X; 1 
l=] 1 © -1i|lx|-|0 [ta 
E 0^9 -—4]Lx, 0 


y =x + 2x, + 3x; 
where u(t) = 8(r) = unit impulse. 


Problem B-6-7 Prove that the solution of the following set of n 
difference equations 


x(k + 1)= È gux(k) + È Mim Um(k) 


(121,2; 2558) 
is given by 


n k-i m P. i 
x(k) -À $(K)x;(0) + p> >» P $u(k — 1 — s)hy ugs) 
n k-1 m T 
= p2 $:5(K)x;(0) + A A PORT US ==) 
where the $,; are the solutions of the following equation: 
$,;(k + 1) = È Sms) 
with the initial conditions $:0) = 8;; where 8;; = 0 for i + jand à; = 1 fori — j. 
Note that this problem is the same as proving that the solution of 
x((K + 1)T) = G(T)x(kT) + H(T)u(kT) 
where G(T) and H(T) are constant matrices, is given by 
X(kT) = G*(T)x(0) +3 G(T)H(T)u[(k — 1 — s)T] 
(k =0,1,2,3,...) 


(5j —1,2,...,n) 
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Controllability 
and Observability 


7-1 INTRODUCTION! 


In solving basic problems of control of dynamic systems we must know the 
kind and amount of information necessary for achieving a desired control of 
a dynamic system. Consider the system shown in Fig. 7-1. The state and the 
output of the plant or dynamic system are described by the following equa- 
tions: 


X = Ax + Bu 


y = Cx + Du 0-1) 
where X — n vector (state vector) 
u =r vector (control vector) 
y = m vector (output vector) 
À — n X n matrix (transition matrix) 
B-—nxr matrix (control matrix) 


C — m X n matrix (output matrix) 
D — m xr matrix (transmission matrix) 


!For detailed exposition on the subject of controllability and observability see, for ex- 
ample, (7-7, 7-2, 7-3). Figures in parentheses refer to materials listed at the end of this 
chapter. 
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Dynamic System 
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- 


Figure 7-1 Control system. 


A, B, C, and D are assumed to be constant matrices. The first equation 
of Eq. (7-1) gives the plant dynamics or system dynamics. The second 
equation of Eq. (7-1) specifies the output transformation. In Eq. (7-1) 
A characterizes the plant dynamics or system dynamics in the absence of 
u, B specifies how the plant or dynamic system is constrained with respect 
to u, C specifies how the output is constrained with respect to x, and D 
represents the direct transmission from u to y. [If every state x can be 
affected directly by u, the plant or dynamic system is not constrained 
with respect to u. If all the state variables x,(t) can be measured, the plant 
or dynamic system is not constrained with respect to y.] In the control 
System shown in Fig. 7-1, the controller observes the output y(/) and, based 
on this observation, determines the control vector u(t). Basic questions in 
problems of control of the plant or dynamic system may be stated, as follows: 


1. Is it possible to transform any initial state x(ż,) to any desired state in 
a finite length of time by applying appropriate control input? 

2. Is it possible to identify the initial state x(t,) by observing the output 
y(t) for a finite time interval? 


Kalman introduced concepts of controllability and observability and gave 
answers to these basic questions (7-1). Our main concern in this chapter is 
giving a clear explanation of concepts of controllability and observability 
and answering the same questions. 

Concepts of controllability and observability are of fundamental import- 
ance in modern control theory. In fact, controllability and observability may 
be interpreted as necessary and sometimes as sufficient conditions for the 
existence of a solution to most control problems. 

In the following, we present definitions of controllability and observa- 
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bility. Note that hereafter in this chapter we shall call a plant or dynamic 
system which is described by Eq. (7-1) simply a system. 


Controllability A system of Eq. (7-1) is said to be com- 
pletely state controllable if for any t, it is possible to construct an uncon- 
strained control vector u(t) which will transfer any given initial state x(1,) 
to any final state x(t,) in a finite time interval, t, «C 1 < t,. (We may con- 
veniently take the final state to be the origin of the state space.) 

Complete state controllability is neither necessary nor sufficient for the 
existence of a solution to the problem of controlling the output y(t) of the 
system. For this reason it is desirable to define separately complete output 
controllability. Output controllability allows the consideration of systems 
with time delays and with direct transmissions from control input to output 
(7-2). 

A system of Eq. (7-1) is said to be completely output controllable if it is 
possible to construct an unconstrained control vector u(t) which will transfer 
any given initial output y(;) to any final output y(t,) in a finite time interval 
west < kh. EU in 

Although complete controllability is generally a necessary condition for 
the existence of a solution to a control problem where 1, is undefined, a 
stronger property of controllability is needed when the control process is 
confined to fixed finite intervals. Such stronger property of controllability 
is defined in terms of total controllability (7-2). A system is said to be 
totally (state or output) controllable if it is completely controllable on every 
interval tj; 1 «& t. 


Observability A system is said to be completely observable on 
lo S t X 1, if, for every t, and some t,, every state x(t,) can be determined 
from the knowledge of y(t) on t, « t < t,. Essentially, a system is completely 
observable if every transition of the system's state eventually affects the 
system's output. 

From the foregoing definition complete observability means that once 
A and C of Eq. (7-1) are given and the response y(t) over a finite time 
interval is known, it is possible to determine the initial state of the system 
x(t,). Consider a simple example. If the output transformation matrix C is 
given as 


C=[1 00 --- 0 
then in the absence of the control vector or u(t) = 0 we have 
y(t) = x(t) 


In this case, the state variable x,(t) can be measured or observed but x,(t), 
X(t), ..., x«(t) are neither measurable nor observable. If the initial values 
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of state variables x.(to), X3(to), - - -> Xn(to) can be determined on the basis of 
measured values of x,(t) over a finite time interval t, S t < 1,, then the 
system is completely observable. The concept of observability is useful in 
solving the problem of reconstructing unmeasurable state variables from 
measurable ones in the minimum possible length of time. 

Similar to the case of total controllability we define total observability as 
follows: If the system is completely observable for every t and every /, > to, 
the system is said to be totally observable. 


Outline of the chapter This chapter presents a detailed 
exposition of concepts of, as well as theorems on, controllability and 
observability. Our discussions are limited to linear systems. Section 7-2 
presents a mathematical background necessary for discussing control- 
lability and observability of linear dynamic systems. Section 7-3 gives 
an elementary exposition of controllability of linear time-invariant systems. 
Section 7-4 presents detailed discussions of observability of linear time- 
invariant systems. Section 7-5 is concerned with controllability and 
observability of linear time-varying systems. Section 7-6 presents the relation 
between controllability and observability of dynamic systems and the rela- 
tion between the linear dynamic system and its impulse response. 


7-2 MATHEMATICAL BACKGROUND 


In this section we present a mathematical background which we shall use in 
the rest of this chapter. First, we briefly explain the convex set; then we 
define the adjoint system. Use of adjoint systems often simplifies theoretical 
study of optimal control problems. We shall next discuss representation of 
elements in the n-dimensional vector space V relative to a given basis. Finally, 
we shall define Gram matrices and Gram determinants and present a few 
theorems on Gram matrices. 


Convex set A set is said to be convex if all the points of the 
line segment joining every pair of points in the set are also in the set. 

Let us define the set of all the points in the n-dimensional space which can 
be reached from the origin in the time interval tọ < t < t, as I'(t, ¢,). Then, 
It, t,) is a convex set. This can be seen as follows: Suppose that partic- 
ular control vectors u, and u, defined for fẹ Ct < t, drive the output to 
points P, and P,, respectively. Since any point in the n-dimensional vector 
space can be represented by an n-dimensional vector x, let us represent 
points P, and P, by vectors x,(t,) and x,(z,), respectively. Consider any point 
P, on the line segment joining points P, and P,. Let us represent point P, 
by a vector x;,(t,). Then x;(t,) can be expressed as 


X3(¢,) = ¢x,(t,) + (1 — ¢)x,(t,), Occ«l 
Clearly point P, can be reached by use of the control vector u,, where 
u, = cu, + (1 — c)u, 


Hence P, is in I(t), t,). Thus the set I(t), /,) is shown to be convex, since any 
point on the line segment joining every pair of points in I(t), /,) is also in 
Its, 1). 

If we wish to show that the convex set I(t, t,) coincides with the whole 
space V, we need show only that I'(f,, ¢,) is not bounded in any direction in 
V; in other words, the vector x(t,) spans the n-dimensional space V. 


Adjoint system Consider a system described by 
X = A(t)x (7-2) 


where x is an n vector and A(t) is an n x n matrix. We assume that the ele- 
ments of A(t) are absolutely integrable as functions of : in the interval 
t, X t < t;. Suppose that cr, to) is the unique fundamental matrix satisfying 


FIDU, t) = AME), ds n) =I (7-3) 


Then differentiating both sides of 
P(t, t)O-'(t, t) = I 
with respect to t, 


d E d. 
2 (o, 191D, n) + OC, 4) FC, 6) 
= AOE, LONG, t) + dt, AIO“, 1) 
= A(t) + 9(r, 4) C (- (1, 4)] = 0 


Hence, we obtain 


die- =— 9^ AU) 7 0,5)-1 (7-4) 
Let us define 
(to, 1) = B=, to) = W*( t) (7-5) 
Then Eq. (7-4) becomes 
SIH t= — WG DAO (7-6) 


Hence we see that Y*(t, /;) is a fundamental matrix for the system 


y = —yA() 
where y is a row vector. 


By taking the conjugate transpose of Eq. (7-6) we obtain 
diwe 1 = —A*O0YG 6). Pent) =I (7-1) 


We find that W(t, to) is a fundamental matrix for the system 
z= —A*(t)z (7-8) 
Equation (7-8) is said to be adjoint to Eq. (7-2). The matrix differential 
equation (7-7) is said to be adjoint to Eq. (7-3). Conversely, Eq. (7-2) and 
Eq. (7-3) are said to be adjoint to Eq. (7-8) and Eq. (7-7), respectively. 
If the matrix A(t) is skew Hermitian, or 


A(t) — —A*(t) 
then the system 
X — A(t)x 
is said to be self-adjoint. s 
Example 7-1 -- Consider the following system: 
y n (n-1) 
paw + oo bag =0 (7-9) 
Let us obtain the adjoint system. Equation (7-9) can be written as 
x = Ax 
where 
0 1 0 ee 0 
x 
i b. 0 1 0 
Xs 
x= 76 x Ld Bw . . . 
Ws 0 0 B. ss id 
(a-1) 
Xn x =A a dal 669 oi 
The adjoint system is 
y= —AÀ*y 
where 
0 0 0 à, 
i3 iu gw Oey 
Y2 
ee uii 0 das 
Yn 0 Os. = 4, 
or 
» = O5, Yn 
Y» - z Gn-1 Yn (7-10) 


Ýn = —Yn-1 + Yn 
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By eliminating y;, Ys, . . . , Yn-ı from Eq. (7-10), we obtain 


(n) (n-1) a 7 
(=) ys aie F cem SF (—)@n-19n + days = 0 (7-11) 
Equation (7-11) is adjoint to Eq. (7-9). 


Representation of elements in n-dimensional vector space relative 
Notice that from Eq. (7-5) we have 


W(t, to) BU, bo) = I 

which is the same as 
(Wilt, to), (651) = Oy = 1 i=j 
=0 ixj 

(j= ER), 

where W.(t, tj) and ®,;(t, tọ) are the ith column of W(t, tẹ) and jth column of 
c(t, tə), respectively. From Eq. (7-12) and since n columns of ®(t, t,) are 
linearly independent and n columns of-¥ (t, t) are also linearly independent, 
a set of vectors @,(t, t,) (i= 1, 2, ..., n) is a basis for the n-dimensional 
state space V and a set of vectors WV (t, to) (J= 1, 2, ..., n) is a basis for the 


dual state space V’. Therefore, referring to Section 2-3 we can express any 
vector y in V as 


to given basis 


for all t and f, 


(7-12) 


y(t) = È (Vt. ta), YCD) Bt, to) 
This can be seen easily. Since y(t) is expressible as 
y(t) = x a; Bt, to) 
the inner product of Y(t, te) and y(t) becomes 
(YG, to), YO) = (wt 1o), È atit, 4) = 


7 


Hence, 
y(t) = 2: (V. (t, to), ¥(t)) Bi (t, to) for all t and t, 


We shall often use such representation of vectors in the rest of this 
chapter. For example, for a given basis &,, &,..., Ên of the n-dimensional 
vector space V, we shall express any vector y of V uniquely as follows: 


ym b (2: Y)& 


where 3;,2,,..., %n are the dual basis of the basis €,, 8,,..., Ên- 


Gram matrix and Gram determinant Suppose that x,, X, 
...+, X, are elements of an m-dimensional vector snace V Then v n matriv 
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(x,X) (X, Xy) (xi, Xn) 
(x; Xx) (Xs X2) (X, Xn) 
G = (œx) = 
(x,X) (Xm X) (Xn Xn) 


is called the Gram matrix of the vectors X;, Xs, ... , Xn- Its determinant 


|Œ; xj)| is called the Gram determinant. 
Note that the Gram matrix can be represented as A*A, since 


(x, Xj) (x, X) (Xi, Xn) 
(Xa Xj) (Xe, X9) (Xs, Xn) 
G= , : 
Qu, X) (Xw X) (Xm Xn) 
Xj Xsi "s a Xi; Xi diis Xin 
"X. dg 05008 me Xe, Xa °° * Xen 
= = A*A 
Rin Xm c5 X Xmi Xm °° 70 Xman 
where 
Xu Xiz Xin 
Xo Xoo Xon 
7 S ë 
Xmi Xm2 E Xmn 


The n x n matrix G is also called the Gram matrix of the m x n matrix 
A. (Clearly, if A is a real matrix, then A* = A’ and the Gram matrix can 


be written as A’A.) A 7 
We shall next prove that the Gram matrix 1s always positive definite or 


positive semidefinite. 


Theorem 7-1 The n x n Gram matrix A*A of an m xn 
matrix A is positive definite if the rank of A is n, and it is positive semidefinite 
if the rank of A is less than n. 


Proof Consider the Hermitian form 
x*(A*A)x = (Ax)*(Ax) = y*y 
=} + yc + nYn 
PRENNE ENTE SEES 


Notice that x*A*Ax becomes equal to zero if and only if y = Ax = 0. If 
the rank of A is n, then the only vector x that satisfies Ax — 0 is x — 0. 
Hence the Gram matrix A*A is positive definite if the rank of A is n. 

Nonzero vectors x that satisfy Ax — 0 exist if and only if the rank of 
A is less than n. Hence the Gram matrix A*A is positive semidefinite if the 
rank of A is less than n. This completes the proof. 

Notice that any positive definite matrices and positive semidefinite 
matrices can be represented as Gram matrices. (Refer to Prob. A-7-7.) 


Theorem 7-2 A necessary and sufficient condition that real 
or complex m vectors X;, X», . . . , X, be linearly independent is that the Gram 
matrix ((X,, x;)) be nonsingular, or the following Gram determinant be 
nonzero. 


(x, X) (X, Xa) (X,, Xn) 
(X, X,) (Xa, Xp) (Xa, Xn) 
|(xi, xj)| = s i e 
E X) (X. Xə) (Sis X,) 
Proof To prove this, suppose that x,, X», .. . , x, are linearly 


. , €, not all zero such that 
CX, + eX, + -+- FH 04X, = 0 


dependent. Then, there exist constants c,, c,,. . 


Hence, 
m n 
2 cix; xj) = (x, 2 CX) = 0 
j= j= 


This means that column vectors of the n x n matrix ((x,, X;)) become linear- 
ly dependent. Hence, 
| (X, Xj) | = 0 
Conversely, if 
|(X;, X))| = 0 


then there exist constants c,, c,,. .. , c, not all zero such that 


È e x) = 0 (i= 1,2,...,n) 
Hence, i 

à CiXis X CX, = > 2 &c(Xx, x) = 0 
it follows that 


| 
| 
| 


which means that x,, X», ..., x, are linearly dependent. Hence, we have 
proved that a necessary and sufficient condition that vectors x,, X.,..., Xn 
are linearly independent is 


[(x, x)] #0 


which implies that the Gram matrix is nonsingular. This completes the proof. 
(Note that the Gram determinant is nonzero implies that it is positive. See 
Prob. A-7-6.) 


Linear independence of column vectors which are continuous 


functions of time Consider next the following m x n matrix H(z, t): 
hy to) Ins, to) hin(t, to) 
hay(t, to) — Bast, to) hast, to) 


H(t, t) 


hast, to) 
i H,(t, 1,)] 


hat, to) Asst, to) 
[H,(t, to) i H.(t, to) i 


where the functions h;;(t, te) (i = 1, 2,..., m; j= 1,2,...,n) are continu- 
ous functions of t for the interval t, < t < t,. The vectors H,(t, t), H(t, to), 
. ,H,(t, to) are called linearly independent if and only if 


ce, H(t, to) + coH(t, to) + --- + c4 Hart) = 0 (7-13) 


where c,. c», ..., Cn-are constants, implies c, = c, = --- = c, = 0. The 
vectors H,(t, to), H(t, to), ... , Ha(t, to) are called /inearly dependent if there 
exist constants c,, Co,. . . , c, Not all zero such that Eq. (7-13) holds true. 

For time-varying vectors H,(t, t,) and H,(t, to), we define the inner product 
thus: 


(H, H) = (^ Bra. t) HAt, t) di 


In Section 2-3 it was stated that any definition of inner product must satisfy 
the four axioms. Clearly, the present definition satisfies the four axioms, 
namely 

1. (Hi, Hj) = (H; Hi) 

2. (cH;, Hj) = ¢(H;, Hj) = (Hi, cHj) 

where c is a complex number 

3. ŒH; a H;, H, + Hj) = (Hi, H,) + (A, H,) + (Hi, Hy) + (H;, Hy) 

4. (HB, Hi) > 0 for H; + 0 
We shall define the norm || H;|| of H; = H,(¢, to) by (H; Hj)'*. 

The Gram matrix of H(t, tọ) is given by 
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[(H,H; (H, H) --- (H,H,) 
(H,, H,) (H,, H,) eae (H;, H,) 
LOL, H;) (Hn, H;) "n (H,, H,) 
D rü ty ü 
| BrB.dt J JHYHdt se f , Hr H,dt 
ty * ty ü 
|J, Brat f ,HiHudp se Í , HE Hd 
t CT "cu 
| [ , HtHadr f QHEH.de f , HT H,dr 
H"H, H*H, --- H*H, 


, HfH, HfH, ... HfH, 
- f. d x% . 


dt. 


H;iH, HH, --- H*H, 


a 
= y H*(t, to) H(t, to) dt 


Theorem 7-3 A set of vectors H,(t, to), H(t, 1), ... , Ha(t, to) 


> 


where 
hy (t, to) 


hy Xt, to) 
H(t, 1) = . 


hat, to) 
and the elements A(t, to) (i= 1,2,... ,m;j-—1,2,...,n) are continuous 
functions of ¢ over the time interval t, t «t, are linearly independent 
over the same time interval if and only if the Gram matrix 


t 
l3 H*(t, to) H(t, t,) dt 

of the m x n matrix 
H(t, to) = [H,(t, to) i H(t, ty) i "ie 


is nonsingular, or the Gram determinant 


| Hat to)] 


|, H| = | { i P(t, t) Hy(t, t,) dt 
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The proof of this theorem is similar to that of Theorem 7-2 and is, 
therefore, omitted. We shall use this theorem in discussing controllability 


and observability of linear time-varying systems. 


From Theorem 7-3 we can immediately obtain the following result: 
A necessary and sufficient condition for x,(t), x,(t), ..., Xn(t), a set of n real 
or complex valued continuous functions of ¢ defined over the interval 
t, <t < t, to be linearly independent is that the Gram determinant 


= In xf(t) x, (t)dt 


| (xi, xj) 


be nonzero. 


7-3 CONTROLLABILITY OF LINEAR 
TIME-INVARIANT SYSTEMS 


In this section we shall discuss controllability of linear time-invariant 
systems. We shall treat systems of the form 


X = Ax + Bu 
y = Cx + Du 
where x= n vector (state vector) 
u =r vector (control vector) 
= m vector (output vector) (m « n) 


A=nxXn matrix 
B=2n Xr matrix 
C — m x n matrix 
D — m x r matrix 


or their equivalent discrete-time systems. The controllability of linear time- 
varying systems is discussed in Section 7-5. 

In the rest of this section, we first consider complete state controllability 
of a simple discrete-time system and determine an optimal control law that 
will bring every state of the system to the origin in a minimal number of 
sampling periods. Then, we shall consider complete state controllability and 
complete output controllability of continuous-time systems. 

Consider a simple discrete-time linear time-invariant system with a 
single control signal. For time-invariant systems, the initial time /; can be 
chosen zero. Assume that the equation describing the system is given by 

x((k + 1)7) = G[x(kT) + Fu(kT)] (7-14) 
where X — n vector (state vector) 
u — scalar (control signal) 
G =n X n nonsingular matrix 
F — n X 1 matrix 
T — sampling period 


! 


The desired state of the system is assumed to be identically equal to zero for 
all t > KT. The system is state controllable if there exists a control signal 
u(t) defined over a finite time interval 0 «t SAT such that starting from 
any initial state x(0) the state x(t) is zero for t > kT or x(t > kT) = 0. If 
every state is controllable, the system is completely state controllable. If 
this system is completely state controllable, it is possible to bring the arbitrary 
initial state x(0) to the origin in a finite time interval 0 C¢ « KT. 

We shall first make a preliminary investigation for obtaining a necessary 
and sufficient condition for complete state controllability. Referring to Eq. 
(7-14), we have 

x(T) = G[x(0) + Fu(0)] 
By setting X(T) = 0 wecan obtain a set of initial states x(0) from which we can 
bring the system state to the origin in one sampling period. Namely, 
0 = G[x(0) + Fu(0)] or x(0) = —Fu(0) 


If the magnitude of the control signal u(0) is not bounded, then the initial 
state x(0) lying on the vector F can be brought to the origin in one sampling 
period. A set of initial states x(0) from which we can bring the system state to 


the origin in two sampling periods can be obtained as follows: From Eq. 
(7-14) 


x(2T) = G[x(T) + Fu(T)] 
By setting x(27) = 0, we obtain 
0 = G[x(T) + Fu(T)] 

= G[Gx(0) + GFu(0) + Fu(T)] 
Since G is nonsingular, 

x(0) = —Fu(0) — G-! Fu(T) 
Similarly, a set of initial states X(0), from which we can transfer the system 
state to the origin in n sampling periods, can be obtained as follows: 

x(0) — — Fu(0) — G^'Fu(T) — G-*Fu(2T) 

ees E "UE Or— Ty (7-15) 

Let us denote G-:*! F by &, (i = 1, 2,..., n). It can be shown by induction 


that the set I'(g) of all initial states which can be transferred to the origin in 
at most q steps by the application of an appropriate control signal is given by 


T(4) = (xix = T Bié} 


where the £, are arbitrary numbers. Note that if £,., is linearly dependent 
on &,, Ês,- . . , &, then so is Êm for every m > k. Henceif I(k) = T(k + 1), 
then T(k) = T(m) for all m Sk: 

A necessary and sufficient condition for complete state controllability in 
the discrete-time case can be stated in the form of a theorem as follows: 


a t I 


Theorem 7-4 (7-1) A discrete-time system described by Eq. 


(7-14), rewritten as 
x((k + IT) = G[x(kT) + Fu(kT)] 

is completely state controllable if and only if the composite n x n matrix P 
where 

P = [F | GF i --- | GF) 
is of rank n. (Notice that since G is nonsingular this condition is equivalent 
to stating that the composite n X n matrix Q where 

Q-[FiG"Fi- į GF 

is of rank n.) 


Proof Notice first that the matrix Q has rank n means that 
the vectors F, G^'F, ..., G^"*'F are linearly independent. If the set of 
vectors &,, Ên ..., Ên where &; = G-^'*!F is linearly independent, it isa 
basis in the state space of the system. Any initial state x(0) can be written 
as a linear combination of the vectors &,. [See Eq. (7-15).] Hence, every 
state can be transferred to the origin in at most n steps or n sampling pe- 
riods and so the system is completely state controllable. 

Conversely, suppose that only k (where k < n) of the vectors &,, &,,..., 
$, are linearly independent but the system is completely state controllable. 
Then the first k of these vectors are linearly independent and, therefore, 
T(k) = T'(m) for all m > k. Hence IX) is the set of all controllable states. 
But, the dimension of I'(k) is k which is less than n. This contradicts the as- 
sumption that the system.is completely state controllable. Therefore, the n 
vectors &,, &,, ..., Ên must be linearly independent and the rank of Q must 
be n. This completes the proof. 

We shall next consider the problem of finding a control signal u which 
will transfer the state x starting from an arbitrary initial state x(0) to the 
origin in some finite time T. Specifically, we shall solve the problem of 
finding an optimal control law that will bring every state of a discrete-time 
system described by Eq. (7-14) to the origin in a minimal number of sampling 
periods. We shall present this problem and solution in the form of a theorem. 


Theorem 7-5 (7-7) Consider a discrete-time completely state 


controllable system described by 
x((k + 1)T) = G[x(kT) + Fu(kT)] 


where x — n vector (state vector) 
u = scalar (control signal) 
G =n x n nonsingular matrix 
F =n X 1 matrix 
T = sampling period 
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Define 
6&, = G-F 
Every state of this system can be transferred to the origin of the state space 


in a minimal number (n or less) of sampling periods if and only if the con- 
trol law is given by 


(Ime 1,2; 2 orgs) 


u(kT) = — (9, x(kT)) (7-16) 


where 7, is the first vector in the dual basis 7,, 72, 


++» %n Of the basis &,, 
&,,..., n of the n-dimensional state space. 


Proof Let us denote the set of all initial states by I'(m). 
Then at any time KT, any vector in T(m) (where 1 < m < n) can be repre- 
sented in the following form: 


x(kT) = È BGs = È (0 XKT)& 


Suppose that Bm = 0, that is, that x(kT) is in T(m) but not in I'(m — 1). 
Using the control law given by Eq. (7-16) and the relationship Gé;,, = &,, 
we obtain 


x(k + DT) = G $ (n. xk). 


- X Gs XAT IE: = E (7o x(k + 7) 


We thus see that the control law given by Eq. (7-16) is the only control law 
assuring that x((k + 1)7) will be in I'(jt — 1). Hence we have proved the 
theorem. 


Notice that the control law given by Eq. (7-16) satisfies the principle of 
feedback, since the control signal is computed from the measurement of the 
system state x(kT). Note that the control law, Eq. (7-16), may not be physi- 
cally realizable, since constraints may prevent the instantaneous measurement 
of all state variables. 


We shall next extend Theorem 7-4 to the case where u is an r vector. 


Theorem 7-6 A discrete-time system described by 
x((k + 1)T) = G[x(kT) + Fu(kT)] (7-17) 
where X — n vector 
u = r vector 


(state vector) 
(control vector) 
G =n Xn nonsingular matrix 
F=n Xr matrix 

T = sampling period 
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is completely state controllable if and only if the composite n X nr matrix P 
where 
P=[FiGFi--- i G"F] 
is of rank n. (The matrix P has nr columns.) The proof of this theorem is 
omitted since it is quite similar to those of Theorems 7-4 and 7-1. 
We shall next consider the case where the system is continuous-time and 
linear time-invariant. The equation of the system is 
X = Ax + Bu (7-18) 
n vector (state vector) 
r vector (control vector) 
A=nxXn matrix 
B=nxXr matrix 
Complete state controllability of the system can be stated by the following 
theorem: 


i 


where x 
u 


Theorem 7-7 (7-1, 7-4) A continuous-time system described 
by Eq. (7-18) is completely state controllable if and only if the composite 
n X nr matrix P where 


P=([B} AB i --- | A”-'B] 


is of rank n. (The matrix P has nr columns.) 


Proof The proof of this theorem is similar to that of the 


discrete-time case. aah. 

Suppose that the system is state controllable and the initial state x(0) can 
be transferred to the origin of the state space in a finite time interval 
0 « t « T. Since the solution of Eq. (7-18) is 


x(t) = e^[x(0) + [e "Bul 
at t = T we have 
0 = ex(0) + f" e^"Bu(d* 
Noting that e^" is nonsingular, 
x(0) — -f e-^Bu(r)dr 
Referring to Section 6-4 we can write e-^ in the following form: 


p-i 5 
e^ = > Q Ai 
i=0 


dii li M i i n 


where p is the degree of the minimal polynomial of A. Since the œ; are scalar 


valued functions of 7, let us explicitly indicate this dependency on 7 and 
write 


e^ = S alr) At 
Then we obtain 


x(0) = — = f Ý ai,(t)A‘Bu(r)dz (7-19) 


Since u(7) may be written as 
u(r) = X w(r)e, 


where e; is the unit vector which has a 1 in the jth place and 0’s elsewhere, 
Eq. (7-19) can be written as follows: 


p-l 7 T 
x(0) = — $ (J, a2w9)a*B, 
iz0 j=} 0 
where B; denotes the jth column of B. Let us define 
[aiu 67 = By = scalar 
0 
then we have i 
p-1 r 
x(0) s * 2 P. AB; 
iz0 jel 


From Eq. (7-20) we see that any initial state x(0) which can be transferred to 
the origin of the state space is a linear combination of Bj, AB,,..., A?^! Bj 
(j= 1, 2,..., r). Hence if the rank of the matrix Q, where 


Q-[B B, | -++ | AB, | APB] 
= [B ! AB | *-- | A?-'B] : 


is n (meaning that there is a set of n linearly independent vectors which 
consititute a basis for the n-dimensional state space), then the system is 
completely state controllable. If p — n, then Q — P. Therefore we can state 
that if the rank of P is n, the system is completely state controllable. Notice 
that if p <n, the A^B;, where h =p, p+ 1l, ..., n— l, are linearly 
dependent on B;,..., A?^'B;. Hence, in the case of p <n, we have the 
same conclusion; that is, if the rank of P is n, then the system is completely 
state controllable. 

Conversely, suppose that the system is completely state controllable but 
the rank of P is k, where k < n. Then, the set I(K) of all initial states which 
can be transferred to the origin is of k-dimensional space. Hence the dimen- 
sion of T'(&) is less than n. This contradicts the assumption that the system is 
completely state controllable. Therefore the rank of P must be n. This com- 
pletes the proof. 


(7-20) 


Notice that if the system equation is given by 


x = Ax + Bu 
where x is an n vector, u is a scalar, and 
0 1 0 Zr 0 0 
0 0 1 ee 0 0 
A= June dw 
0 0 0 zc 1 0 
— ü, — An- — n? — à, 1 


then it is easily verified that 
B, AB, ..., Ar 
are linearly independent. Hence all systems of this type are completely state 
controllable. Ş 
Example 7-2 Consider the following system: 


Bi 132] a 


Let us examine state controllability of the system. 


In this system 
—3 1| | 0 ] 
= B= 
a > ud j 1 


A - [5 ill }-Lisl 
ane e L5]L 1 1.5 
We see that vectors B and AB are linearly independent and the rank of the matrix 
[B | AB] is two. The system is, therefore, completely state controllable. 


Example 7-3 Discuss state controllability of the following system: 


Hoeg HRL 


Hence, 


(7-21) 


In this system 


Since 
—3 1 


AB - [5 Angh 


we see that vectors B and AB are not independent and the rank of the matrix 
[B | AB] is one. Therefore, the system is not completely state controllable. In fact, 
elimination of x, from Eq. (7-21), or the following two simultaneous differential 


equations 
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X, = —3xX,+% +u 
X, = —2x, + 1.5x, + 4u 
yields 
X, 1.5x, — 2.5x, = ú + 2.5u 

or, in the form of the transfer function, 

As) _ s+2.5 

U(s) (s +2.5)(s — 1) 
It is clear that cancellation of the factor (s + 2.5) occurs in the numerator and de- 
nominator of the transfer function. (Thus one degree of freedom gets lost.) Because 
of this cancellation this system is not completely state controllable. 


Consider the nth-order system whose transfer function is 


— p) 
Si q(s) 
where p(s) and q(s) are of orders m and n (m < n), respectively. If p(s) and 
q(s) have a factor or factors in common and cancellation occurs, then the 
order of the system becomes n — k (0 < k < m) which is lower than n. Such 
a system is not completely state controllable. This can be seen as follows: 
Suppose that the system is completely state controllable. Then the set 
In — k) of all initial states which can be transferred to the origin is of 
(n — k)-dimensional space. The dimension of T(n — k) is less than n. This 
contradicts the fact that the system is of order n. 

A. necessary and sufficient condition for complete state controllability is 
that no cancellation occurs in the transfer function or transfer matrix. 
(Stated differently, the transfer function or transfer matrix has some cancel- 
lations if the system is not completely state controllable.) In the following 
we shall consider in detail complete state controllability in the s domain. 

Consider the linear time-invariant system ` ~ 


x = Ax + Bu, x(0) = 0 (7-22) 
where X — n vector (state vector) 
u — scalar (control signal) 


A — n X n matrix 
B=n x 1 matrix 


Laplace transforming Eq. (7-22) and solving for X(s) we obtain 


X(s) = (sI — A)-! BU(s) (7-23) 
where 
pis) 
1 p(s) 
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and the p,(s) (i= 1, 2, ---> n) are polynomials in s. We shell r^ d 
what we mean by cancellations in the transfer matrix(sl — A) B. = m = 
(sl — A)-! B is said to have no cancellation if and only if es g Em 
p,(s), Pols), - +» país), and |sI — A| have no common €— [came d e 
has a cancellation, then the system cannot be controlled in the : eee 
the canceled mode. The following theorem gives a necessary and su 
condition for cancellations in the transfer matrix. 


Theorem 7-8 (7-5) Consider an n vector (sI — A)-!B, 


where A =n X n matrix 
B=n x 1 matrix 
I= n x n identity matrix 
s = complex variable such that (sl — A)-! exists 


(s1 — A)? B has a cancellation if and only if the rank of 
P — [BL AB | --- 1A?-'B] 


is less than n. 


Proof Let us define 
gp = (sI — A)“'B 
By the use of Eq. (6-38) we can express p as follows: 


Define 


where v, is an n vector. Then 


p = 4 — (1-24) 
i=0 
Suppose that g has a cancellation. Then the numerator of the right-hand side 
of Eq. (7-24) must have the following form: 
n- n-2 r " 
Y sv) = (5 — 5) E (7-25) 
j=0 j= 


where s, is an eigenvalue of A. By equating the coefficients of si (i = 0, 1, 
2,...,n — 1) of both sides of Eq. (7-25) we obtain 


i 
4 
Wu 
i 
[ 
| 
i 
4 
i 
i 
[ 


Then 
Vo F SkY, + siv, + «se + SR 
-(— SeWo) + (sw, — SiW,) + (siw, — SEW.) eee 
+ (RW, — SE Wa.2) + Sk Wns 
=0 (7-26) 
Equation (7-26) implies that 


"n m 
| : i-l -9 ' 
à oA B+ se eA! "B+ --- + s?'a B= 0 
which can be rewritten as 
CoB + c,AB + c,A?*B + ... 


n-i-] 


where = j 
2 c= x Oss 548k 
= 


+ e, ,A77!B = 0 (7-27) 


Since the coefficient of A*-! B is Cn-1 = Qn, = 1, Eq. (7-27) implies that the 
vectors B, AB,..., A?-! B are linearly dependent. Hence the rank of P is 
less than z. Thus we have proved that if ø has a cancellation, then the rank of 
P is less than n. 

We shall next prove that if the rank of P is less than n, then o has a cancel- 
lation. If the rank of P is less than n, then there exist constants vy,. ^ 
'/n- 1, Not all zero, such that ipu, vr 


y B +y, AB + y, A°B+---+ 7, ,A"'B=0 


Since 
B-—(I—A)o 
we have 
n-i n-1 
2j y, A GI — Alp = (sI — A) X A9 = 0 (7-28) 
t=0 
For s such that |sI — A | + 0, Eq. (7-28) implies that 
n=l 
Fray =o 
Using the identities E 
p=p 
Ag = sp — B 


A°p = s*g — AB — sB 


A-o => mig = A^-?B SUE SA"-3B LES Rq us 5^-*B 


TE 


we obtain 
n=l n-1 n-l n=l 
2, y,A'g = Y, qus — p y; AB — s > yA *B 
i= = = i=2 
n=l 
Lo i T ayACHHB — sii, B 
izn-2 
=0 
Hence, 
n-1 n-2 n=l 
ruse = Ds X yACUB 
i=0 j=0 t=j+l 
or 


-2 n-1 
Y s X qACCUB 
giana (7-29) 


D yis! 
= iS i=0 
The denominator of Eq. (7-29) indicates that a cancellation occurred. This 
completes the proof. 
We have shown that the condition that the rank of the matrix 
[B | AB | --- | A?-!B] 
is n is equivalent to the condition that no cancellation occurs in the transfer 
matrix (sI — A)*' B. 


Theorem 7-9 A necessary and sufficient condition that the 
System defined by 
X — Ax 4- Bu 
where X — n vector (state vector) 
u — scalar (control signal) 
À — n X n matrix 
B=n x 1 matrix 


is completely state controllable is that (sl — A)-!B has no cancellation. 


Proof The proof of this theorem follows from Theorems 7-7 
and 7-8. We leave details to the reader. 
Example 7-4 Consider the system defined by the following transfer 
function: 
X(s) .. K(s + a)(s + b) 
U(s) (s-rays + b)(s+c)(s +d) 
The transfer function has cancellations and, therefore, the system is not completely 
state controllable. The simplified state space representation of this system when 
a, b, c, and d are distinct is 


392 CONTROLLABILITY AND OBSERVABILITY 
2; —a 0 0 0 Zi 0 
à||0-b o oOl|lz 0 
Zs 0 0 —c 0 L2 T 1 [i 
2 0 0 0 -—dJj Lz, 1 


Observe that because of cancellations in the transfer function the motions of A 
and z, do not depend on z. (Therefore z, and Z cannot be controlled.) 

From this example we see that the distinct eigenvalue system is completely state 
oo if and only if all the components of the control matrix in the simplified 
State space representation (meaning that the transition matrix is j i 

atrix is in the d 
form) are different from zero. "e 


Example 7-5 Consider the system defined by 


Eb t to 
Us) (s +aP(s Fes +d) 


ae a, c, and dare distinct. The simplified state space representation of this system 


Z —a 1 0 0 ve 0 
À| |9-a 0 ol|lz| Ti 
&| | 0 0 -e olal" 4 /4 
d 0 0 0 —dllz, 1 


This system is completely state controllable. 
Consider next the system defined by 


X(S)_ _K(s + a)?(s + d) 
U(s) (s+aP(s + cys +d) 


where a, c, and dare distinct i i i i 
: 1 . The simplified state space representation of this system 


Z —a 1 0 0 Zi 0 
Al 0 —a 0 0 Ze 0 
Zs 0 0 —c 0 Zi Y 1 [u] 
EA 0 0 0 —d EA 0 


nee gi is not completely state controllable, because of cancellations in the 
Tansier function, or stated differently because the moti 
oti 
Mimi ie ca ons of z,, z,, and z, are un- 
Note that if in the control matrix (the coefficient matrix of u) any of the com- 
voee corresponding to the distinct eigenvalues or corresponding to the last row 
of the Jordan block (for the multiple eigenvalues) is zero, then the system is not 
completely state controllable. 


Com e a y (0) he lin ar tim -1n 
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x((k + 1)T) = G[x(kT) + Fu(kT)] 


or 
x = Ax + Bu 

can be stated in various ways. We shall give below four definitions which are 

equivalent. (7-1) 

1 The system is completely state controllable if and only if a set of 
vectors F, GF, ..., G"-'F ora set of vectors B, AB, ..., A"-! B spans 
the n-dimensional state space. 

2 The system is completely state controllable if and only if there are no 
cancellations in the transfer function or transfer matrix. 

3 The system is completely state controllable if and only if there exists 
a sampled data controller which has “deadbeat” response. 

4 The system is completely state controllable if and only if (a) in the 
Jordan canonical form of G or A no two blocks are associated with the 
same eigenvalues, and (b) in the control matrix (the coefficient matrix 
of u) all the components corresponding to the distinct eigenvalues and 
those corresponding to the last row of each Jordan block (for multiple 
eigenvalues) are different from zero. 


To examine condition (a) note that if two or more blocks in the Jordan 
canonical form of G or A are associated with the same eigenvalues, then the 
minimal polynomial is of lower degree than n. Hence the rank of 


[F | GF i --- i GF] 


or 
[B | AB! --- | A"-!B] 


is lower than n. Thus in such a case the system is not completely state con- 
trollable. Condition (b) follows directly from discussions given in Examples 
7-4 and 7-5. 

Consider next a stable or asymptotically stable system. If the norm of 
u(t) is bounded due to saturation, then it may take longer to bring the state 
x(t) starting from the initial state x(0) to the origin. If, however, the system is 
neither stable nor asymptotically stable, then the region of controllability 
with bounded u(t) will not coincide with the n-dimensional state space V, 
but is a subspace of V. (Refer to Prob. A-7-12.) In general, we can say that 
if the norm of u(t) is bounded, not all initial states can necessarily be taken to 
the origin. The set I',(q) of all initial states that can be transferred to the origin 
in the presence of the constraints on the control vector u(t) is a subset of the 
set I'(g) of all initial states that can be transferred to the origin in the absence 


of the constraints on u(t). 
In the case of a discrete-time system with a quantized control vector u(r), 


| 


only a finite number of specific initial states can be transferred to the origin. 
Often, however, we allow a small region, called 8 region, around the origin 
and consider the system is acceptable if we can transfer the state to this 6 
region in a finite time interval 0 € t € kT. Such a system may be said to be 
6 state controllable. 


We shall next consider complete output controllability of systems in the 
form of theorems. 


Theorem 7-10 The system described by 


X = Ax + Bu 
ae. (7-30) 


where X =n vector 


u=r vector (control vector) 
y = m vector (output vector) 
A =n X n matrix 
B-—nxr matrix 
C — m x n matrix 


is completely output controllable if and 
matrix P where 


(state vector) 


(m « n) 


only if the composite m x nr 


P = [CB | CAB ! CA?B i see | CA"-!B] 


is of rank m. (Notice that complete state controllability is neither necessary 
nor sufficient for complete output controllability.) 


Proof Suppose that the System is output controllable and the 
output y(/) starting from any y(0), the initial output, can be transferred to 
the origin of the output space in a finite time interval 0 < t < T. Namely, 


y(T) = Cx(T) = 0 (7-31) 
Since the solution of the first part of Eq. (7-30) is 
x(t) = e^ [x(0) +f i e^ ^r Bu(r)de 
at / — T, we have 
x(T) — e^ x(0) + r e7^'Bu(z)dr | 
Substituting Eq. (7-32) into Eq. (7-31) we obtain 
y(T) = Cx(T) 
- Ce^' x(0) +f ,€ Buts] ie 
On the other hand y(0) = 


(7-32) 


(7-83) 


Cx(0). Notice that the complete output control- 


lability means that the vector Cx(0) spans the m-dimensional output space. 
Since e^" is nonsingular, if Cx(0) spans the m-dimensional output space, so 
does Ce^"x(0), and vice versa. From Eq. (7-33) we obtain 


Ce4?x(0) = -cev[" e^ ^'Bu(r)dT 
M -cf e*'Bu(T — r)dr 


T 
Similar to the case of Theorem 7-7 we can express L e^ Bu(T — 7) d« as 


a sum of A!B, as follows: 
i e*Bu(T — 7)dr 
0 


where 
T 
v«— | &ie)u(T — dr = scalar 
0 
and a@,(7) satisfies 


part * a(r) Ai (p: degree of the minimal polynomial of A) 
i=0 


and B, is the jth column of B. Therefore, we can write Ce^? x(0) as 
-1 T p 
Ce^'x(0) = — Y; X y,CA'B, 
i=0 j= 


i i i i bination of 
E this last equation we see that Ce” x(0) is a linear com 
CA'B, (ju 0, 1 [ p—1;j1,2,...,r). Note that if the rank of 
j a So 4952939 i > 
Q where 
Q = [CB | CAB; CA?B | --- | CA?"'B] 
is m, then so is the rank of P, and vice versa. [This is obvious if p = n. If 
p «n then the CA"B,(where p << h <n — 1) are linearly dependent on 
CB;, CAB,, ..., CA?-!B,. Hence the rank of P is equal to that of Q.] 
If ihe rank of P is m, then Ce^" x(0) spans the m-dimensional output space. 
This means that if the rank of P is m, then Cx(0) also spans ei m-dimensional 
i llable. 

output space and the system is completely output contro 

[DM lg suppose that the system is completely output controllable 
but the rank of P is k, where k < m. Then the set of all initial outputs which 
can be transferred to the origin is of k-dimensional space. Hence the dimen- 
sion of this set is less than m. This contradicts the assumption that the system 
i i he proof. 
is completely output controllable. This completes t 

From Theorem 7-10.it can be immediately proved that, in the system of 
Eq. (7-30), complete state controllability on 0 < t <T implies irc 
output controllability on 0 < t < T if and only if m rows of C are linearly 
independent. 
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Finally, we shall present a theorem on complete output controllability 
of a system involving a direct transmission from control input to output. 


Theorem 7-11 (7-2) A continuous-time system described by 


x = Ax + Bu 
y = Cx + Du 
where X — n vector (state vector) 


u =r vector (control vector) 

y = m vector (output vector) 

A=n X n matrix 

B=nxr matrix 

C=m X n matrix 

D =m x r matrix 
is completely output controllable if and only if the composite m x (n + 1)r 
matrix P where 


P = [CB | CAB i CA’B i—-- ; CA”-'B i D] 
is of rank m. UT 
Proof The proof is similar to that of Theorem 7-10. We 
leave details to the reader. 
Note that D indicates the direct transmission from u to y. The existence of 


such a direct transmission from u to y does not affect state controllability, 
but it always aids output controllability. 


7-4  OBSERVABILITY OF LINEAR TIME-INVARIANT 
SYSTEMS i 


The notion of observability of a dynamic system is associated with the 
processing of data obtained from observations on the system. The basic 
question is: given a mathematical model of a free dynamic system, x = Ax, 
and the output transformation, y — Cx, what is a necessary and sufficient 
condition for us to determine the system state at any time t by observing the 
output over a finite time interval f, < £ < t, + T? (If the linear time- 
invariant system is observable in a time interval 0 < t < T, then it is obser- 
vable in any time interval 0 < £ < 7,, where T, > 0.) The answer to this 
question is given as theorems in this section. 

Notice that in discussing observability of systems we need consider 
only free dynamic systems. The reason for this is explained as follows: 
Consider a linear continuous-time system described by 


I 


€ M —— MÀ á—— — 
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x = Ax + Bu (1-34) 
y = Cx 
where x=n vector (state vector) 


u =r vector (control vector) 
y=mvector (output vector) (m<n) 
A=n xn matrix 

B=nxr matrix 

C= m x n matrix 


The system is said to be completely observable if and only if the initial state 
x(0) can be determined from measurements of the output y(t) over a finite 
time interval 0 < t < T. Since x(t) can be determined from Eq. (7-34) as 


x(t) = e^x(0) + | e^c-"Butod* 


we obtain 
t 
= Al At-D Bu dr 
y(t) = CeAtx(0) + J eM Bul) 
Since matrices A and B are known and u(t) is also a known quantity, the 


integral term on the right-hand side of this equation is a known quantity. 
Therefore, this quantity may be subtracted from the observed value of y(t). 


Hence we may write 
t 
RE " A«-?Bu(7)d7 = Ce^'x(0) 
Fe) = y(t) — Cf eA Bulr)dr 


Therefore, for investigating a necessary and sufficient condition for obser- 
vability it suffices to consider the following system which has no input u(t). 


x = Ax (7-35) 
y = Cx 


where x, y, A, and C are defined in Eq. (7-34). 


x ((A+ D7) x(K7) 


T Seconds 
Delay 


Figure 7-2 Linear discrete-time system. 


| 
| 
| 
| 


| 
| 


Consider the system shown in Fig. 7-2. To simplify our analysis let us 
assume the system has a single-output. Equations of the system are then 
given by 

x(k + 1)T) = Gx(kT) 
y(kT) = Cx(kT) 
where X(KT) = n vector (state vector) 
X(kT) = scalar (output signal) 
G =n Xn nonsingular matrix 
C=1Xn matrix 
T = sampling period 


(7-36) 


Before we present theorems on observability, we shall discuss some preli- 
minary material. Notice that the output signal (AT) is given by 
y(kT) = Cx(kT) = (C*,x(KT)) | (k = 0, 1,2, ...) 
where * denotes the conjugate transpose. x(kT) and x(0) are related by 
x(kT) = G* x(0). Therefore, we can write y(kT) as 
W(KT) = (C*, G* x(0)) = ((G*)* C*, x(0)) 
Let us put 
(G*)* C* = ot 

Note that the o£ are column vectors. If this system is completely observ- 
able, then the initial state vector x(0) can be determined from n observations 
of the output. 

Suppose we take n observations y(T), y(2T), ..., y(nT), then n com- 
ponents of x(0) can be determined by solving the following n simultaneous 
equations for x,(0), x,(0), . . . , x,(0): 


WkT) = (pt, xO)  (k—12,....,n) 
or 


PXO) + 12x40) + ++ - + PinXn(0) = y(T) 
PoX,(0) + P22 X(0) deines Qs X (0) = yQT) (7-37) 
PniX (0) T Pn2X(0) ee PrnX,(0) -—- y(nT) 
where Pris Pros - - - » Pen are n components of the row vector øx. [Notice that 
if we take more than n observations y(T), y(2T), ..., we can obtain more 
than n equations of the form of Eq. (7-37). The number of linearly 

independent equations, however, is at most n.] 
We shall next consider a necessary and sufficient condition that Eq. 


(7-37) can be solved for x,(0), x,(0), ..., x,(0). Equations (7-37) can be 
rewritten as 


ex=y (7-38) 


| 
PN 


where 
Qu Pir Qin x,(0) y(T) 
Qon Qm ^^ Qmm x,(0) yQT) 
e|, : nais ESI a = : 
Qn Qn» `t? Pnn x,(0) y(nT) 


Theorem 7-12 A system of linear equations defined by Eq. 
(7-38) possesses a solution if and only if the matrix ® and the augumented 
matrix [® | y] where 


Pir “Piz ~ Øm (T) 

Pr P "^ Pon yQT) 
[Pi y=] . ; : , 

gi Pno ^O Pun y(nT) 


are of the same rank. Moreover, the solution is unique if and only if rank 
® = rank [6 | y] = n. 


Proof Suppose that rank  — rank [P ! y =r (1&7 € n). 
We can assume, without loss of generality, that the first r columns of @ are 
linearly independent. Then, the first r columns of [ | y] are also linearly 
independent. Since the rank of [@ i y] is assumed to be r, y must be a linear 
combination of the first r columns of ®, or 


Pu Pi: qur 
Lr Par 
y= Ci n + Co ; = + Cr 
Pn Qn» Pnr 
By choosing 
x,(0) € 
x,(0) €» 
x=| x(0 |=] cz 
xz (0) 0 
Xn(0) 0 


Eq. (7-38) can be satisfied. If r — n, the vector x which satisfies Eq. (7-38) 
is unique. 
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Conversely, if Eq. (7-38) possesses a solution, then we can write Eq. 


(7-38) as follows: 


qui qus . Pin WT) 

Pu P22 Pan y(2T) 
x,(0) - + x,(0) : + cae ae x,(0) 3 x s 

n Prz Pan y(nT) 


Clearly y is a linear combination of the columns of ®. Hence 
rank ® > rank [d y] 

Since, however, ® is a submatrix of [Ẹ i y] we have 
rank ® < rank [o | y] 

Hence, 
rank ® = rank [6 ; y] 

Moreover, if Eq. (7-38) possesses a unique solution, then the column 
vectors ®,, ,,..., Dn where [Ð | 6, i---; ®,] = Pare linearly indepen- 
dent, or rank @ is equal to n. Hence the theorem. ----- 

Notice that if y — 0, then Eq. (7-38) becomes a homogeneous system, or 
€x = 0. If the rank of ® is n (namely, if ® is nonsingular) then the homo- 
geneous system possesses a unique solution x — 0, the so-called trivial 
solution. If, however, the rank of ® is less than n (namely if ® is singular), 
then the homogeneous system possesses nontrivial solutions. 

A system of equations is said to be consistent if it has at least one solution. 
Otherwise it is said to be inconsistent. In the system of Eq. (7-38), the system 
is inconsistent if rank ® < rank [6 ! y]. (The system possesses no solution.) 
If rank ® = rank [® | y] < n, then the system is consistent but has infinitely 
many solutions. B f 


Example 7-6 Consider the system @x = y where 


T2 3 Xi 3 
e-l 4 j| ZH ZH 
2 2 3 Xs 5 
The augmented matrix [@ | y] is given by 
I 263.3 
[iy] = I 4 6 4 
2. 2:53.75 


Simple computation reveals that the rank of «p is 2 and that of [db | y] is also 2. 
Hence the system is consistent. But the system possesses infinitely many solutions. 


In fact, 
xi 2 
x= H = |i — 
Xe a 
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where a is an arbitrary constant. y 
Now we are ready to present the following theorems on observability. 


Theorem 7-13 The system described by Eq. (7-36), rewritten 
x((k + 1)T) = Gx(kT) 
y(kT) = Cx(kT) 
where x(kT) = n vector (state vector) 
y(kT) = scalar (output signal) 
G=n x n nonsingular matrix 
C=1 xn matrix 
T = sampling period 
is completely observable if and only if the composite n X n matrix P where 
P = Ics i G*C* i eter i (G*)*-!C*] 
is of rank n. 


Proof  y(kT)can be written as 
ykT)-(ptx0)  (&—1L2....?) (7-39) 


where ot = (CC 
i = linear combinations 
otice that the o£, where k = n + l,n+2,..., are 
E 9i, t. Hence we need consider only n values of x«T).] From 
Theorem 7-12 it follows that Eq. (7-39) possesses a unique solution x(0) if 
and only if the matrix ® where 


qu gie c gue Pa 
Pa Pz “°° m 9: 
=j. x : = z 
Qu Pno ^'^ mn Pn 
is of rank n, or " 
Qu $n ^'^ Pm 
Piz Paz TAUR Pre 
pt) - : z 
Pin Pon coo Pan 
= [př | pž i- i pa] 


= [G*C* | (G*C* | -+= 1 GC] 


is of rank n. Since G is nonsingular and 
* = [G*C* | G*G*C* i --- | G'(G*)""C*] 
— G*IC* | G*C* i --- 1 (G*y"C*] 


rank ®* = n implies that P = [C* | G* C* | --- 
n. Hence, the theorem. 


Notice that this condition is equivalent to the condition that the com- 
posite n X n matrix Q where 


| (G*)"-! C*] is of rank 


€ 
CG 
Q=| ------ 
CG 
is of rank n. Notice that C, CG, ..., CG*-! are row vectors. 


Since G is nonsingular the condition just obtained is also equivalent to 
the condition that the composite n X n matrix S where 


S = ((G*)*'C* i (G*)C* i -.. | (G*)7C*] 
is of rank n, or the composite n x n matrix T where 
CG"! 


is of rank n. Note that CG-', CG-,..., CG-* are row vectors. 


Theorem 7-14 The system described by 
x((k + 1)T) = Gx(kT) 
y(kT) = Cx(kT) 
X(kT) =n vector (state vector) 
y(kT) = m vector (output vector) 
G-—n*x n nonsingular matrix 
C — m x n matrix 
T — sampling period 
is completely observable if and only if the composite n X mn matrix P where 
P= [C* i G*C* i ose} (G*y-!C*] 


where 


is of rank n. 


Proof The proof is similar to that of Theorem 7-13. (It is 
also quite similar to that of Theorem 7-16 presented later.) We leave the 
proof to the reader. 


We shall next consider observability of continuous-time, linear time- 
invariant systems. 
Theorem 7-15 The system described by 
x= Ax 
y= Cx 
x = n vector (state vector) 
y = scalar (output signal) 


A=n Xn matrix 
C=1 xn matrix 


is completely observable if and only if the composite n X n matrix P where 
P = [G^ i A*C* i ms i AC] 


where 


is of rank n. 


Proof © First, note that if the system is completely observable, 
then the exact value of x(0) is determinable from measurements of y(t) over 
a time duration 0 « 1 < T. Since 

x(t) = e^' x(0) 
and e^! can be expressed as 
ec = S a, (t)A* 
k=0 
where p is the degree of the minimal polynomial of A, we obtain, 
-1 
-x(t) = z a; (t )A*x(0) 
Hence y(t) can be written as 
p-! p-1 . 
y(t) = €x(t) = E a.(r)CA*x(0) = 2; av(X(A*C*, x) 
Let us put 
(AX) C* = ot 
Taking the inner product of a(t) and y(t), we obtain 
-1 
(ar). 9) = Fst), ee) Gp? XO) 


€ (a(t), a(t)(ps, x(0)) 
+ (a(t), & (t) (pi, x(0)) 
+ eee 
+ (at). ats (0) (9$: x(0)) 


(a0). 0) = f. adt 


(7-40) 


(Je E usu — 1) 


where 
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(a0), et) = f" atas 


Equations (7—40) are a set of p equations in p variables (gf, x(0)), (of, x(0)) 
et . (93, x(0)). Rewriting in the form of a vector matrix equation we 
obtain á 


(aos at) (ao, 0) (a, Qp_1) (pe, x(0)) (a, y) 
(a, e) (a, a) (a, Qp) (pf, x(0)) (e, y) 
(25-,, 0.) (a,.,, Q) (5, 0t.) (957. x(0)) on 
(7-41) 


where a = o(t) (k = 0, 1 — i 

; > 5 ..-, P — 1) and y = y(t). Since the a,(t are 
d usi pe of each other, the Gram determinant | (a, a;)| s det. 
ive. Hence Eq. (7-41) can be solved for (o* r j 

x(0)), giving (gi > x(0)), (pi , x(0)), LOCO E) (pe, 


(ps , x(0)) = B, : 
(př, x(0)) = B, 
(p3-1, x(0)) = £p; 


Equations (7-42) are p simultaneous linear equations in x,(0), x;(0), . 
x«(0). "eer Theorem 7-12 it follows that Eq. (7-42) can be solved 
uniquely for x,(0), x.(0), ..., x,(0) if and I p= 

iin = ; (0) nd only if p = n and the rank of 


[pr i při --- | pti] 
= [CF ATGA] 
is n. This completes the proof. 


(7-42) 


- K (A*)n-1C*] 


Theorem 7-16 The system described by 
x= Ax 
y = Cx 
where X — n vector 
y — m vector 
À-—n X n matrix 
C — m x n matrix 


(state vector) 
(output vector) (m « n) 


is completely observable if and only if the composite n x mn matrix P where 
P = [C* i A*C* i i (A*-1C*] 
is of rank n. 


tme mre ae CE Sree 
ernie 
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Proof We shall give only the outline of the proof and shall 

leave details to the reader, since it is quite similar to that of Theorem 7-15. 
In the present system the matrix C* has m columns, or 
C*-—[Ct i Cri -- 1 Cy) 


The output vector y(t) has m components, its ith component being 
-1 
WO = Eo()(A*Chx0) (1 Kim) 
t-0 


where p is the degree of the minimal polynomial of A. Notice that taking the 
inner product of a,(t) and y(t), we obtain pm equations in pm variables 
(CE x(O (A* Cr xO (CA 2-* CE x(0));. 1 Sy 2, 705 Noting 
that the a,(t) are linearly independent, we can solve the set of pm equations 


in pm unknown variables, giving 
((A**Cf, x(0)) = B. 
which can be rewritten as 
oii (0) + Por2X2(0) + +++ + PoinXn(0) = Boy 
QonXi(0) + Por2X2(0) + --- + PornXn(0) = Bos (7-43) 


Q»-1niX(0) + Pp-1m2X2(0) +--+ + Pp-imnXn(0) — By. 1n 
» Prin are n components of the row vector Øx: and 
gi = (A*) C? 
Equations (7-43) are a set of pm equations in n unknown variables x,(0), 


x,(0), ..., x,(0). These equations are necessarily consistent. They have a 
unique solution if znd only if the rank of the pm x n coefficient matrix 


where Pkiis Pki- ++ 


Pour Pore DEN Poin 
Porn Po22 ix Pozn 
$= , , , 
Qp-imi . Qnp-im2. *** Qp-imn 
is n. The conjugate transpose of can be written as 
Qoi Pon *** Qp-1m 
Qoi» os» *** Pp-ime 
Dr = ; : 
Qon Poon *** Po-imn 
= [på i pù! e | Prim] 


= [Cr i Cr} | (A*"! Cz] 
Hence, the initial values x,(0), x,(0), ..., x,(0) can be determined uniquely 
if and only if the rank of the following matrix: 


| 


ers [CF tc | i (A*)"! Cz] 


= i Bs ! A*C* i i (A*)»1C*] 
is n. If p—n, then this condition can be stated that the rank of P where 
P = [C* | A*C* | LACY 


is n. Notice that if p < n, then the columns of 
(A*)^C* (h=p,p+1,...,2—1) 


are linearly dependent on Cx, Cr, ..., (A*-! C*. The condition of 
complete observability in the case of p — n can still be stated that the rank 
of P is n. This completes the proof. 

It is noted that if the rank of P is n, then there is a set of n linearly in- 
dependent vectors among the nm columns of P. These vectors form a basis of 
the state space of the system. Hence the initial state X(0) can be determined by 
measurements of y(t) for a finite time interval. 


Example 7-7 Examine Observability of the following system: 


Xx, 0 1 0 x, 0 
ž% =] © 0 1 |} x o| 
X; —6 —11 —6 xs 1 
X 
y=[20 9 ix 
Xs 
In this system 
0 1 0 
Az 0 0 1 and C-[09 1] 
—6 —11 —6 
Thus, 
20 
C*2cC-|9 
1 
00 -—6]r[20 —6 
A*C*—A/C' —-|Tt 0 -Ti 9: |== 9 
01 -—6 1 3 
0 -—6 367/20 —18 
(A*)?C* —(A'?C'—|0 —11 60 9 |=| —39 
1 —6 25 1 —9 


It can be shown that the three vectors C', A'C', (A^; C' are linearly independent. 
Hence the rank of the matrix [C | A/C' i (A)! C] is 3. By Theorem 7-15 the system is 
completely observable. 


Example 7-8 Examine observability of the following system: 


x 0 1 olx 0 
X. |= 0 0 1}} x. |+] 0 | [uJ 
XS —6 —11 -6)Lx,; 1 
X 
y=(4 5 x 
Xs 
In this system, 
4 
e-e-[s 
1 
00 -6][4 —6 
ewe 0 —1l s|- s 
F 0: 1 —6 1 —1 
0 -6 36][4 6 
(A**?C* —(A)?C —|O —11 60|]| 5|-| 5 
1 —6 25 I —1 


It can be seen that 
(A; C' = (—6)C' + (—5)A'C' 
We find that the rank of the matrix 
[C i A'C' i (A*C'] 
is two. The system is therefore not completely observable. This can be seen clearly 
in the block diagram of Fig. 7-3. The two factors (s + 1) cancel each other. Hence, 


X 
xi "d^ iR, um ^ (s+!) (s+4) 
(s+1) (s+2) (S--3) 


Figure 7-3 Control system of Example 7-8. 


measurements of y(t) do not yield estimation of all the initial values of the elements 
of x(t). In other words, there are nonzero initial states which cannot be determined 
from measurements of y(t). 

We can characterize the notion of observability in the s plane. Consider 
the following system: 


x = Ax; x(0) — x, (7-44) 
y= Cx 
where x — n vector (state vector) 
y — scalar (output signal) 


A=nXn matrix 
C—1*xn matrix 
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Taking Laplace transforms of Eq. (7-44) and simplifying we obtain 
XO = (GI — A)-! x, 
Y(s) = CX(s) 
Elimination of X(s) from Eq. (7-45) results in 
Y(s) = C(sI — A)! x, 


(7-45) 


where 
C(sI — A)! = rA n q) ::: 9n(s)] 


and the g,(s) are polynomials in s. 
If C(sI — A)- has a cancellation, namely the polynomials q,(s), g(s), 
-> Gn(S), and |sI — A| have a common factor, then this canceled mode 
cannot be observed in the output y. The following theorem relates complete 
observability and s plane representation of a linear time-invariant system. 


Theorem 7-17 A necessary and sufficient condition that the 


system : 
X = Ax + Bu 
p= Cx 
where X = n vector (state vector) 
u — scalar (control signal) 
y — scalar (output signal) 


A=nxXn matrix 
B=n X 1 matrix 
C= 1X n matrix 


is completely observable is that C(sI — A)-' has no cancellation. 


Proof Notice that C(sI — A)-' has a cancellation if and 
only if [C(sI — A)-!']* has a cancellation. Since 
[C(sI — A)-']* = (sI — A*)*!C* 
the proof of this theorem follows immediately from Theorems 7-8 and 7-15. 


7-5 CONTROLLABILITY AND OBSERVABILITY OF 
LINEAR TIME-VARYING SYSTEMS (7-2, 7-3) 


In this section we shall discuss controllability and observability of linear 
time-varying systems. We first present theorems on complete state control- 
lability and complete output controllability of time-varying systems. Then 
we shall present a theorem on complete observability of time-varying systems. 
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Theorem 7-18 The linear time-varying system 
x = A(t)x + B(t)u (7-46) 
where ( == n vector (state vector) 
u = « vector (control vector) : 
A(t) — n x n matrix, elements of which are piecewise 
continuous functions of time 
B(t) ^ n x r matrix, elements of which are piecewise 
continuous functions of time 


is completely state controllable on to « t < t, if and only if the Gram 


matrix W(t), ¢;) is nonsingular, where 

Wüst) = iM ext, t)B()B* (9t. £)dt (7-47) 
and ®(t, t) is the unique f undamental matrix satisfying 

A et, 5) = AODH te), Pltos fo) = 1 
(Since the Gram matrix is positive definite or positive semidefinite, if it is non- 


singular, it is positive definite. Notice that the Gram matrix W(f,, ¢,) is Hermi- 
tian. If A(t) and B(t) are real, then B(f,, t) is real and W (t,,1;) becomes real 


symmetric. ) 


Proof The solution of Eq. (7-46) is given by 
ti 
x(t) = Blt, texte) + f, D NBO! 


Without loss of generality we may assume the final state to be the origin of 
the state space. Suppose that the Gram matrix W(t, t;) is nonsingular. Then 
W^! (to, t,) exists. If we choose u(t) to be uj(t) where 


u(t) = —B*(r)O*(t,, 0) W- (t. 6)x(6) (7-48) 
then x(t,) = 0, since by the use of this particular u(t) we obtain 
x(t) = D(t,, t))x(t9) 
= E D(t, r).B(G)B*G)e*(t, £)dt Wo (to, t)x(t) 


= (t, t)x(t) 
— e«t, tf Ë Dios BOBAD Gs Dd W^ s 1) (8) 
= e(t, to)x(to.) — e(t, to) W(to, t,)W- (ts, £,)X(to) 
= 0 
where we have used the relation 
e(t, t) = P(t, to) Plo, t) 


eee Se án 


Hence if the Gram matrix W(t), t,) is nonsingular, then any initial state x(f;) 
can be transferred to the origin of the state space during the time interval 
IE IS 
Conversely, suppose the system is completely state controllable. Then we 
can prove that the Gram matrix W(t,, :?,) is nonsingular. To prove this 
suppose W(t, t,) is singular. Then there exists some state x(t.) =+ 0 such 
that (x(t,), W(t,, 1) x(1,)) = 0. Define 
u,(t) = —B*(t)o*(t,, t)X(to) for t, Şt Xt, 


Then 
ty 
(x(t,), W(to, t,)x(t,)) = f | (x, GBB*O*x)dt 


= i (B*o*x, B*®*x)dt 
= fa), wat 


= f" afd — o (7-49) 


Since u,(t) is continuous in t, Eq. (7-49) implies that u,(t) = 0 for £, < t « f. 

On the other hand, if the system is completely state controllable, then 
there exists a control vector u,(r) for t, << ? < t, defined by Eq. (7-48) 
which satisfies the condition 

ti 
x(t) = — | ©, PBE u (r)dt 
Then 
l| xz) I? = Gt), x9) 


(x(t), — f^t, DBEa (dr) 


Il 


— f Cto), Dlt Bu ot 


- f} EOD x(t), wot 


j (u,(r), u(r))dt = 0 


This is a contradiction, since we assumed that x(f,) = 0. Hence, W(t, 1,) 
must be nonsingular. This completes the proof. 

Notice that the condition that the Gram matrix W(t, /,) is nonsingular is 
equivalent to the statement that 


B*(t)®*(t,, t)À 40 


on fy « t < t, for each n vector A ~ 0 in the state space V. This can be seen 
as follows: The condition that the Gram matrix W(f,, t,) be nonsingular 


e M MÁY —————— 


implies that the columns of B*(t) ®*(f, t) on t, «t <1, are linearly in- 
dependent. This is equivalent to stating that 


B*(t) *(t,, t) A, #0 
on f, « t « t, for each n vector A, Æ 0 in the state space. Since D(1,, £) is 
nonsingular, let us define 
A = [@*(t,, t]! A or A, = ®*(t,, t;)À 
Since 
S(t, t) = e(t, fo) P(t; t) 
we obtain 
B*(t)*(t,, t)À, = B*(1)O* (to, t)Ð* (t, to)A 
= B*(t)@*(t,, 1)A 
Thus the condition that the Gram matrix W(t,, ¢;) is nonsingular is equivalent 
to the statement that - 3 
B*(t) $*(,1) A #0 
ont, « t « t, for each A + 0 in the state space V. (See Prob. A-7-17.) 
We shall next present a theorem on complete output controllability of 
linear time-varying systems. 


Theorem 7-19 The linear time-varying system 


x = A(r)x + B(r)u 
y = C(t)x 
where x —n vector (state vector) 
u=r vector (control vector) 
y — m vector (output vector) (m <n) 
A(t) — n x n matrix, elements of which are piecewise 
continuous functions of time 
B(t) — n x r matrix, elements of which are piecewise 
continuous functions of time 
C(t) = m x n matrix, elements of which are piecewise 
continuous functions of time 


is completely output controllable on to « t « t, if and only if the Gram 
matrix V(t,, ¢,) is nonsingular, where 


Voto 1) = [^ Coe. DBEB, NCEA 
to 
and (tf, f,) is the unique fundamental matrix satisfying 


4 ex, t) = ADC, t), Dlia to) = I 
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Proof The proof is similar to that of Prob. A-7-17. We 
leave details to the reader. 

From Theorems 7-18 and 7-19 we can. immediately prove that complete 
state controllability on 1, < t < t, implies complete output controllability 
ont, « t « t, if and only if the m rows of C(?,) are linearly independent. 

We shall next consider complete observability of time-varying systems 
We shali define observability of time-varying systems as follows: A lise: 
varying system is said to be completely observable on the time interval 
t, <t<t, if for given t, and t, there is no initial state x(t,) 4 0 in the 
state space V such that y(t)=0 on to St<ty.The system is said to be 
completely observable if for any t, and some f, there is no initial state such 
that y(r) = 0 for all £ > t,. The system is said to be totally observable if 


Theorem 7-20(7-2) The System described by 
X = A(t)x 2E 
cens C (7-50) 
where X — n vector (state vector) 
y — m vector (output vector) (m « n) 
A(t)=n x n matrix, elements of which are piecewise 
continuous functions of time 


C(r)—mxn matrix, elements of which are piecewise 
continuous functions of time 


is completely observable on the time interval 1 f 
t 
Gram matrix o X t « t, if and only if the 


ty tn 
| M(t,, tj) = i D*(t, t))C*(r)C(r)axt, t,)dt (7-51) 
is nonsingular. Here @(t, ż,) is the unique fundamental matrix satisfying 
d 
dt ett, to) a A(t )et, to), e(t, to) = I 


F or complete b t 3 the fore oin ust h 0 
finite t > fo. 


Proof The solution of the first of Eq. (7-50) is given by 
X(t) = e(t, t)x(t,) 
Then y(t) is obtained as 
y(t) = C(r)x(t) = CDC, 1,)x(t,) (7-52) 


Suppose the Gram matrix M(t), t) i i 
" e G: oh) 1$ nonsingular. Then M-'(r,, t 
exists, Premultiplying Eq. (7-52) by ®*(r, t,)C*(t) and integrating. = 
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tı 
f, 9*«. (exo 
to 


xà |. D*(t, 1)C*(r) CQ), to)dt ] x(t) 
Hence x(f,) can be expressed as 
X(t.) = M™(tp, t) f. O*(t, 1.) C*(ty(t)dt (1-53) 


Every initial state x(¢,) can be determined from y(t) using Eq. (7-53). 
Conversely, suppose the system is completely observable but the Gram 
matrix M(t, ¢,) is singular. From Eq. (7-51) and Eq. (7-52) we obtain 


fiyvvGOXt = x", 186) 


Since by assumption the Gram matrix M(t,, ¢,) is singular, there exists a state 
x(t.) z 0 such that x*(£,)M(t,, t,)x(t)) = 0. For such x(t), y(t) z 0 for 
the time interval t, < ? < t,. This implies that x(t,) cannot be determined 
from y(t). This contradicts the assumption that the system is completely 
observable. Hence the theorem. 

Notice that the conditions for complete state controllability, complete 
output controllab:lity, and complete observability for time-invariant systems 
can be immediately derived from the corresponding conditions for time- 
varying systems (refer to Prob. A-7-18 and Prob. B-7-8). 

: In concluding this section, note that the concept of state controllability 
is essential in solving the regulator problem where we are concerned with 
transferring the state to the origin starting from a given initial point in the 
state space. If the system is not state controllable, then initial states induced 
by disturbances not in the controllable subspace cannot be controlled. This 
means that such a system may have long-term transients, or even instability. 
In certain cases control problems may be formulated in terms of the output 
y rather than the state x. In such cases it is essential to have output con- 
trollability, rather than state controllability. Finally, note that if the system 
is not completely observable, then not all states x(t.) can be determined from 
the output y(t), t > to- 


7-6 RELATION BETWEEN CONTROLLABILITY AND 
OBSERVABILITY 


In this section we shall discuss the relation between controllability and 
observability of dynamic systems. The analogies apparent between con- 
trollability and observability can be expressed by the principle of duality due 
to Kalman. 


Principle of duality (7-7) ^ Consider the system S, defined by 
x = A(t)x + B(t)u 
y = C(t)x 


where =n vector (state vector) 


x 
u=r vector (control vector) 
y — m vector (output vector) (m <n) 
A(t) — n X n matrix, elements of which are piecewise 
continuous functions of time 
B(t) — n x r matrix, elements of which are piecewise 
continuous functions of time 
C(t) = m X n matrix, elements of which are piecewise 
continuous functions of time 


and the dual system S, defined by 
z= —A*(t)z + C*(t)v 


n = B*(tz 0-59) 


where z= n vector (state vector) 
v = m vector (control vector) 
n=r vector (output vector) 
A*(t) = conjugate transpose of A(t) 
B*(r) = conjugate transpose of B(t) 
C*(t) = conjugate transpose of C(t) 
The system S, is obtained by 


(1) Replacing the free dynamic system by its adjoint system. 
(2) Interchanging input and output constraints. 


The principle of duality states that the system S, is completely state 
controllable (observable) if and only if the system S, is completely observable 
(state controllable). 


To verify this principle let us write down necessary and sufficient con- 
ditions for complete state controllability and complete observability of 
systems S, and S. For the system S, 


(1) A necessary and sufficient condition for complete state control- 
lability is that 


f Dlt, 1)B(t)B*(1)@*(t,, t)dt 


is nonsingular. 
(2) A necessary and sufficient condition for complete observability is that 


f " e»*(r, t) C* (0) CQ), to) dt 


is nonsingular. 


` 


(1-54) 


RUDI ICE ih amn a Dre e g a a ERU a TRU a AE RUE 
beer Ue ASSN ESS REAN AS 


For the system S,, A i 
(1) A necessary and sufficient condition for complete state control 


lability is that 
li (t, t)C*()C()V* (to, t)dt 
ty 


is nonsingular. T — 
(2) A necessary and sufficient condition for complete observability is that 


[js Y(t, t)BO)B* (tt, td! 


is nonsingular. 
Notice that since ®(r, to) and V(t, to) satisfy, respectively, 


4 91,1.) = ADB 1), Bl) =I 


System S, 


System S3 


Figure 7-4 System S, defined by Eq. (7-54) and its dual system S, defined by Eq. (7-55). 
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and 


Ute) = ANNY S), inh) X 


S(t,, t) and W(t, t) are related by 
P(t, t) = V*(t, to) 


E pcs the conditions, the truth of this principle is apparent. Figure 
shows the system S, defined by Eq. (7-54) and its dual syst 
Bhagi ystem S, defined 
. By the principle of duality the concept of observability may be dismissed 
by defining it as the dual of state controllability. The important point to note 
is that by the principle of duality a system constrained at the output can be 
converted into a system constrained at the input. If a system is completely 
state controllable (observable), then its dual system is completely observable 
vais "pae Hence the solution of the minimal time observation 
problem becomes identical with the solution of the mini i 
min 

problem for the dual system. CAO 

Note that the property of complete state controllability and the property 


of complete observability are Invariant under linear nons ular transfor- 
ing 1 


Impulse response function Consider à system described by 


X) _ (s+ 1) 
Us) (s+ Is+ 2643) (7-56) 


This system is not completely state controllable. Referring to Example 4-4 


the simplified stat i ; IE 
pa P e space representation of this system is given by Eq. (4-43), 


yd = 0 Oll», 0 
h|5] 9 —2 Olly tela lii 
Ds 0 0 —3j|ly, 1 


and x is given by x = y, + y, — i 

; Y:. The impulse response function / 
the system defined by Eq. 7-56 : : a2 ion A(t) of 
and y(0) = 0 y Eq. (7-56) is obtained as follows: Since for u(t) = &(t) 


ert. 30 0 0 0 
y¥)=e"B=|0 e* o |] 1] =| e-2 
0 0 gg l et 
the impulse response function A(t) is given by 
A(t) = e^*t — e-x 
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i PTT AMA 
U(s) ~ (s + 2s + 3) 


This system described by Eq. (7-57) is completely state controllable. Clearly, 
cancellation of the common terms in the transfer function defined by Eq. 
(7-56) results in the irreducible transfer function defined by Eq. (7-57). 
Note that the impulse response function does not represent the linear system 
completely, but represents only that part which is completely state control- 
lable and completely observable. 

Let us next consider realization of a system having the impulse response 
function A(t, 7). We wish to realize the system with least order among a 
number of linear systems which possess A(t, 7) as the impulse response 
function. 

Suppose we try to realize the impulse response function /(t,7) by an 
nth-order system of the form given by 


(7-57) 


X — A(t)x 4- B(t)u (7-58) 
y = C(t)x. 
where X — n vector (state vector) 
u = scalar (control signal) 
y = scalar (output signal) 


A(t)—n x n matrix 
B(t) =n x 1 matrix 
C(t) = 1 x n matrix 


The elements of A(t), B(t), and C(t) are assumed to be piecewise continuous 
functions of time. Note that impulse response function A(t, 7) can be realized 
by an nth-order system of the form given by Eq. (7-58) if and only if A(t, 7) 
is expressible as 
A(t, 7) = P()Qc) 

where P(t) and Q(7) are 1 x n matrix and n x 1 matrix, respectively, and the 
elements of P(t) and Q(t) are continuous functions of time ¢. To verify this 
statement note that ®(f, 7) can be written as 


e(t, 7) = (t, t)dx(t,, 7) 


!By means of a suitable coordinate transformation the linear dynamic system may be 
decomposed into the following four mutually exclusive subsystems: 

Subsystem A: controllable but not observable 

Subsystem B: controllable and observable 

Subsystem C: neither controllable nor observable 

Subsystem D: not controllable but observable 


For detailed discussions of canonical structure of linear dynamic systems see, for example, 
(7-6, 7-7). 


and 


h(t, 7) — 0 PG 
A(t, T) = C(t) B(t, 7) B(7) 
= C(t) @(t, t)) (t., 7) B(7), t2T 
= P()QG) 
where P(t) = C(t) W(t, tj) = 1 x n matrix 
Q(T) = B(t,, Tr) B(7) = n x 1 matrix 


If the linear system having A(t, 7) as the impulse response function does 
not include a lower-order subsystem which possesses the same impulse 
response function, then the system is of least order. 

Realization of h(t, 7) is irreducible (if the system is time-invariant, the 
system transfer function has no cancellation) if and only if the realized 
system is completely state controllable and completely observable. In other 
words realization of A(t, 7) by a completely state controllable and completely 
observable system yields least-order z. 


Concluding comments In concluding this chapter we list 
reasons why concepts of controllability and observability are important in 
control theory. 


1. Concepts of controllability and observability give a complete solution 
to the optimal regulator problem. The linear independence conditions 
for controllability and observability are of fundamental importance in 
the analysis and synthesis of multivariable control systems. Famil- 
iarity with concepts of controllability and observability enables one to 
formulate meaningful control problems and to make the necessary 
assumptions, if any, at the outset of the analysis. 

2. Controllability and observability are dual. Therefore, Wiener's 
filtering problem can be considered as dual to the optimal regulator 
problem. 

3. Concepts of controllability and observability clarify the relationship 
between the linear dynamic system and the impulse response function 
(or the transfer function). 


EXERCISE PROBLEMS 
AND SOLUTIONS 


Problem A-7-1 Consider the system 


(m) (m-1) 
Xt ax tees an 1X +a,x=uU (7-59) 


re e p a a: 
ee - 


and the adjoint system 
n (n-1) = É " 
(yp + (Hayy eR Dae aycu  — (7-60) 


where the bar denotes the complex conjugate. Let A(t) and A(t) be the impulse 
response functions of the systems described by Eq. (7-59) and Eq. (7-60), respec- 
tively. Show that 
h(t) = —h(—t) 
Solution Taking the Laplace transform of Eq. (7-60) with 


(n-1) (2-2) 


»(0) = x0) =... = X0) = x0) —0 
the result is 
[((—sy* + àá(—sy7! + +++ + à. iC) + Gr] Y(s) = U(s) (7-61) 
The impulse response function h,(7) is then obtained from Eq. (7-61) as 
1 
= a CAE F R e E S y ay a E F |? 1 0 
h(t) = Z ‘oa Fals c caos) z] (1-62) 
h(t) — 0 t<0 


The Laplace transform of Eq. (7-59) with zero initial condition is 

(s? + ays"! + +++ asas + as) X (s) = U(S) 
The impulse response function A,(f) is then equal to the inverse Laplace transform 
of X(s) with U(s) = 1. Thus 


1 
My enn EET EE CENE 


ht). s. 1<0 


The complex conjugate of /,(r); namely, À,(t), is 


t>0 


í - : 0 (76 
Ha-fiacmnerR——E.px Ue UW 


Thus comparing Eq. (7-62) with Eq. (7-63) and noting that 


e; + a] —— 


and 


- — S = at 
= Fe EN zl e 
we obtain 
h(t) = —h(—t) 
Problem A-7-2 Consider a system described by 
| X=A(t)X (7-64) 


and a system described by 
Y = —YA(t) (7-65) 
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Show that if Y is a nonsingular solution of Eq. (7-65), then Y-!C where C is a 
constant matrix is a solution of Eq. (7-64). 


Solution From Eq. (7-64) and Eq. (7-65) 


YX + YX =0 (7-66) 
Hence, 
XX =C 
where C is a constant matrix. Since Y is nonsingular, Y-! exists and 
X = Y-!C (7-67) 
Clearly, Eq. (7-67) satisfies Eq. (7-66), or 
YX = YA()X 
which is equivalent to 
X = A(0X 


Thus, Y~'C is a solution of Eq. (7-64). 


Problem A-7-3 Prove that if n vectors Xs ss 


-> X, are orthogonal 
to each other, then these vectors are linearly independent. 


Solution Let us compute the Gram determinant. 


(Xi, X). (x, Xj) - (X, Xn) 


(Xo, X) (Xa, Xj) -*** (Xa, Xn) 
l(x;, xj)l = $ E . 


(Xs. Xj) (X4, X) ce (Xn, X) 


(X, Xi) 0 LL 0 
0 (Xs Xj) res 0 


0 So pe? Bax 
= (Xi, Xi) (Xs Xe) +++ (x, Xn) 0 
By Theorem 7-2, » vectors Xi X2, . . . , X, are linearly independent. 
Problem A-7-4 Consider the system 
y = Ax 
where x and y are n vectors and A isan X n matrix. Prove that 
(0. y) = A((xi, x3)A' 
where À is the complex conjugate of A. Prove also that 


[679 yj)l-2 IAT (x, x)| 
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Solution Notice that 


A(X, xA’ 
dj ctt Ön Qux) 8° (XXa) ] fan c7 am 
äni TN Õnn (Xs; x) oic (Xn; X4) Qn ctt Ann 
Qa ct Ay (XoY) c (XX 
Ön ctt Öm (CaP *** Fa) 
Qu) 5 Qu») 

= 3 : = (( yj) 


Oh ») rare (n Yn) 


Taking determinants we obtain 
IG y) = [AI] Cis yA] = [AP 1s x] 


Problem A-7-5 (Schmidt Orthonormalization) Suppose that vectors 

Xj, Xo, .. -Xp are a set of basis vectors of a state space V. Prove that it is always 
possible to find a set of orthogonal vectors yi, ys... Ya asa basis of V. 

Notice that this problem can be reduced to that of proving the existence of the 


coefficients Cii, C21, C22, ... , Cnn Such that 


yi = CX, 
Yo = CyiX1 + CooXq 


Ya = CyiX, + CaaXq + °°* + CanXn 


are orthonormal or (yj, y;) = 3j. 


Solution The following method for converting a given set of vectors 
Xi, Xo, ..., X, into an orthonormal set of vectors yj, yo, . . ->Yn is called the Schmidt 
orthonormalization method. In this method we begin with putting xX, =z, and 
normalizing the vector z, by dividing it by its norm ||z, || and setting y; = z,/|| z, ||. 
We then define a vector z;, orthogonal to y, by setting z; = x» — (Xs, yi) yi. Then 
we normalize z, by setting ys = zs/||z; ||. Next we define z, orthogonal to y; and y; 
by setting z; = x; — (Xz, yi)y: — (Xs, ys) ys and then set y, = 2,/|| z; |. The process 
is continued until we obtain z, and y,. Rewritten zi, Zs, ..., Zn and yi Ys... Yn: 


Z 
a= INPS 
Z 
zn—x-(X&yY Ye = TET (7-68) 


n=l Se 
Zn = Xp — ZG Y3Ye y= [Za Il 


Notice that || z, || = 0, since the x, are linearly independent. The inner product of 
yi and Yi is 


(Y. y) = (ri =) = (G) - 


Hence the y; are normal. 
Notice that 


(Ys, Y) = Tal y) 
= [zi 6e 9 7 6 06.92] - o 
(s) = rans = (Xs, ¥)¥1 — (Xs, Y2)Y2); y.) 
= Tal [6s Yi) — Gs Y (Y Y1) — (xs, Yo)(Yo, »)] =0 


On yd = pag [Gs 92 — a Dny) — (3 Gav] = 0 


Similarly, we can show that Ym+1 is orthogonal to y,, yo,..., Ym (n <n — 1), since 


m 
(Zm+is Y3) = (Xm+1 — 2 (Xni, Ye) Yrs yj) 


= (Xnii, yj) — poe Yi) (Ves yj) 


= (Xs, yj) —( 1; yj) =0 a «Jj « m) 


Therefore, the basis (yj, ys, .. . , Yn} is orthonormal, or 


(Ys yj) = 8j (,j—1,2,...,n) 
Equations (7-68) can be rewritten as follows: 

yi = CX) 

Y2 = CaX, F CooXo 

Yn = CuXi + CasXo + +e + CanXn 
The constants Cii, c4, - - © , c44 are determined from Eq. (7-68). 


Problem A-7-6 Prove that Gram determinant |(x;, x] is non- 
negative. j 
Solution From Theorem 7-2, we have | (x, xj) | = 0 if thex, (; = 1, 
2,...,”) are linearly dependent. If the x; are linearly independent, then there exist 
constants c,; such that 
yi = CX, 
Yo = €nX; + CopXo 
Ys = €31X1 + C32X2 + C33X3 


Yn = CuXQ + CXe + +++ + C44X, 


EY 


are orthonormal. Then from Prob. A-7-4, we obtain 
IG x ICP = (ya y2| 2185] = 1 


where 
Cy O0 0 
Cor Cop c 0 
|C|= . . s = Cila tt* Cag 


Cui Cag *°** Cnn 


Therefore, we obtain 


L 1 1 à 5 A 
|(x,x9)|— ZE ak >0 (if the x, are linearly independent) 
| (x; x)|] = 0 (if the x, are linearly dependent.) 


|(x;, x;)| is thus nonnegative. 


Problem A-7-7 -Prove that any positive definite matrices and positive 
semidefinite matrices can be expressible as Gram matrices. 


Solution Notice first that a Hermitian form x*Bx can be reduced 
to the diagonal Hermitian form by means of a suitable unitary transformation 
x = Uy, namely, 

x*Bx = y*U*BUy = X jiyi + Jays + +e + AnInVn 
where A, A», ... , Am are eigenvalues of B. If B is positive definite then X,, ,..., 
Àn are all positive real, If we choose 


WM 9 
T=U AX 


then the unitary transformation x — Tz will transform x*Bx into the following 
form: 
x*Bx = z*T*BTz = 2,2, + ZZ + 00° + ZnZn 
Hence, if B is positive definite, then there exists a nonsingular matrix T such that 
T*BT =I or B = (T*9)-!T-! = (T-)*T-! = A*A 
where A=T" 


Any positive definite matrix can thus be represented as a Gram matrix A*A. 
Similarly, if B is positive semidefinite and has rank m, then there exists a non- 


singular matrix T such that 
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where L, is the m x m identity matrix. Hence 


B = (T*)-: hn T~ 
0:0 


L/0]'[L,!0 
=((T-1)* | E - 
ae] eje] 
= A*A 
where 


Thus any positive semidefinite matrix can be represented as a Gram matrix A*A 


Problem A-7-8 Suppose that G is the Gram matri 
, ! trix of an n xn 
matrix A. If A is of rank m (m < n), then prove that G is also of rank m. 


Solution Since G is the Gram matrix of A, we have 


G = A*A, 
In Prob. A-7-7 we showed that 


where I, is the m x m identity matrix and T isan n x n nonsingular matrix. Hence 
I I 
G = (Ts In19 | p, [Bo 0 
0:0 


where B,, is an m x m nonsingular matrix. The rank of G is therefore mm. 


Problem A-7-9 Consider a discrete-time system with a single control 
signal. The value of the control signal can take three values Tu, 0, and —u,. How 
many initial states can be transferred to the origin in n sampling periods? 


NJ Solution There are three possible values for u. Hence 3" different 
combinations of u(Q), u(1), -..,U(n) exist. Hence at most 3” initial states can be 
transferred to the origin in n sampling periods. 


Problem A-7-10 Given the state Space equation 
X — Ax 4- Bu 


X — n vector (state vector) 

u = y vector (control vector) 
A =n X n constant matrix 

B — n x r constant matrix 


find the condition on A and B such that all modes of oscillation can be excited. 


Solution Derivation of the condition i 
on A and B for this problem 
can be accomplished using the derivation for complete state controllability. Suppose 


where 
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that the system is oscillating in all modes. Then suppose that at some finite time 
t, it is required that all modes of oscillation have zero amplitude. Damping of 
the oscillations is accomplished by applying the control vector u. Then to reduce 
all oscillations to zero at time 7; it is necessary that x(r,) be zero. It is obvious that 
all modes of oscillation must be excitable to do this. The problem can then be 
viewed as that of complete state controllability. Hence, the condition that all modes 
of oscillation can be excited is that n columns out of nr columns of 


B, AB, AB, -AIB 


be linearly independent. 

Problem A-7-11 An important thing to remember in discretizing 
continuous-time systems is that the introduction of sampling may impair control- 
lability. A single-input-single-output system (continuous-time or discrete-time) is 
completely state controllable if and only if no cancellation of poles is possible in 
the transfer function. Notice that the cancellation may take place in passing from 
the continuous-time to the discrete-time case. A system which is completely state 
controllable in the absence of sampling remains completely state controllable after 
the introduction of sampling if and only if, for every eigenvalue of the character- 
istic equation, 


Re i = Re Xj 
implies 
where pes d3bA.. 


T — sampling period 
(Unless the system contains complex poles, the cancellation will not occur in passing 
from the continuous-time to the discrete-time case.) Verify the foregoing statement 
by using the following systems: 
= sta 
(1) G(s) T (s + a)? + b? 


3 
(2) € = Gr x6 43) 


Solution 


(1) The z transform of 
s+a 


CO 7 Gr FE 


is given by 
|. €"Tz(e?7z — cos bT) 
X IG] = zur: FEF cos bT)z FI 


The two roots of the characteristic equation are X, = —a + jb and A, = —a — jb. 
If Im[A, — à] = 2b = 2rz/T or bT = rz (where r = integer) the factor (e*7z — 
cos rz) in both the numerator and the denominator will cancel. Hence for com- 
plete state controllability, we must have T + (rz/b) (where r = +1, 2-2,...). 


(2) The z transform of 


_ 3 
66 = rrr 
is given by 
ZIG] = zsinT a zsin2T 
2?—2zcosT+1  2(2? —2zc0s2T + 1) 
The four roots of the characteristic equation are 4, =j, Me = —j, A, = 2j and 
^, = —2j. If the sampling period is T = 2z/3 we find that 
Pie 3 3z 
tI — 30:02 T) 
and if T = 2x 
xIG] =0 


Hence we see that with these sampling periods the system is not completely state 
controllable. For complete state controllability, T+ 2rz/3, T + rz|2 (r = +1 
Ce 

Notice that we can always avoid the loss of controllability by choosing the 


sampling period sufficiently small compared with the smallest time constant of the 
system. 


Problem A-7-12 Consider the control system shown in Fig. 7-5. 
The controller is a nonlinear controller, Assuming that there is no input signal 
and the system is subjected to only the initial condition, determine the region 
of controllability for the following two cases: 


Case 1: The control signal can assume only two values, u.— +1. 


Case 2: The control signal can assume any value between 1 and —1, or 
=b<w< l. 


| 
(s+2) Cs-1) 


Controller 


Figure 7-5 Control system. 


Solution The system equation is given by 


X+xX-—2x =u (7-69) 
Case 1: For u = 1 this equation becomes 
XT-xXx—2x-21 


from which the equilibrium point is 


x= —}, x=0 


Since the characteristic equation 
M Rx -220 


has roots X = 1 and X = —2, the equilibrium point (— 3, 0) is a saddle point. The 
separatrices which pass through the saddle point have slopes of —2 and 1. Figure 
7-6(A) is a state plane diagram showing the saddle point, separatrices, and several 
trajectories. 


(A) (B) 


Figure 7-6 State plane diagrams for.the system of Fig. 7-5. (A) Diagram showing 
saddle point, separatrices, and several trajectories; (B) diagram showing domain of 
controllability. 


For u = — 1, Eq. (7-69) becomes 
X^Lx—2x—2-—1 
The equilibrium point is now 
x=}, X20 


It is also a saddle point. The state plane diagram is identical to the foregoing, 
except that it is shifted to the right by 1. Figure 7-6(B) is the superposition of 
the two sets of diagrams corresponding to 4 = 1 and u = — 1. 

Graphically we see that the origin can be reached from any point between 
separatrices I and II shown in Fig. 7-6(B). To reach the origin with u = +1 it is 
necessary that the trajectory starting from a given initial point intersects the curve 
S. (The curve S is the switching curve consisting of the trajectory with u = —1 in 
the upper half plane and that with x = 1 in the lower half plane both passing 
through the origin.) To reach the curve S from points lying between separatrices 
I and II but to the left of S it is necessary to use u = 1 until the curve S is reached. 
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Similarly, for the points lying between separatrices I and II but to the right of the 
curve S, it is necessary to use u = — 1 until the curve S is reached. 

Graphically, it is easily seen that it is not possible to enter the region bounded 
by separatrices I and II from the exterior. Hence the region bounded by two 
separatrices I and II is the domain of controllability for u = + 1. 


Case 2: The domain of controllability in this case can be determined similarly 
to Case 1. For u = constant and bounded by +1 the saddle points are 


(-l<gu<l) 


1 da 
x= —3u, ses 


For —1 < u «1 (wis bounded but not necessarily constant), the maximum width 
of the domain of controllability is obtained when | u | is maximum. Hence for Case 2 
the domain of controllability is the same as Case 1 and is bounded by separatrices I 
and II shown in Fig. 7-6(B). 


Problem A-7-13 Show that the matrix equation AX = B, where A, B, 
and X are n X n constant matrices, has a unique solution if and only if 
rank A = rank [A! B] = n 
Solution The given matrix equation is clearly equivalent to the 
following equation: k 


[AX; AX; | +++ | AX,] = [Bi i B3; +++ | By] 


where 
[Xii Xo i iXX 
[B; ! B; | +++ i B,] = B 
or 
AX, = B, 
AX, = B, 
AX, =B, 


By Theorem 7-12 these z equations, respectively, have unique solutions if and only 
if 

rank A = rank [A! Bi] = n 

rank A = rank [Ai B;] = n 


rank A = rank [A | B,] = n 
Clearly B,, Bo, . . . , B, are linearly dependent on n columns of A. Hence the fore- 
going condition is the same as 

rank A = rank [A i B] = n 


Thus we have shown that the original matrix equation has a unique solution if and 
only if 


rank A = rank [A i B] = n 
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Problem A-7-14 Consider the following system: 


Xl —] -2 =2 x 2 
x*X,|2| 0-1 1ļ||x|+|0 jE 
X;j 1 0 —1 Xs 1 

x; 
y=[l 1 0]|»x 

Xs 


Show that the system is completely state controllable and completely observable. 
Solution For complete state controllability, the vectors B, AB, and 
A?B must be linearly independent, where 


2 —4 0 
B=| 01, AB — T Is AB = 0 
1 E EE 
Since 
2 —4 0 
0 1 0 |= =10 0 
1 1 —5 


it can be seen that the vectors B, AB, and A?B are linearly independent. Thus, the 


system is completely state controllable. 
For complete observability the vectors C*, A*C*, and (A*)*C* must be linearly 


independent, where 


1 —1 0 
C*-|1|, A*C*-|-3|, (A*'C*-|5 
0 —1 0 
Since 
1 —1 0 
1 —3 5|/=5+40 
0 —1 0 


The vectors C*, A*C*, and (A*)*C* are linearly independent. Thus the system is 
completely observable. 
Problem A-7-15 Consider a system defined by 


x(k + 1)T) = GI[x(kT) + C*u(kT)] 
y(kT) = Cx(kT) 


where 


C=[1 0 0 0 


= © Oo 0 
ao OQ Q m 
oo m Qo 
or oOo 


Given any unknown initial state x(0), every state vector is reduced to zero at most 
4 sampling periods if and only if the closed loop system is governed by 


x(k + 1)T) = G[x(kT) — C*(C*, x(kT))] 

Solution The given system is completely state controllable, since 
column vectors C*, GC*, G?C*, and G?C* are linearly independent. By Theorem 
7-5, the fastest control scheme is uniquely determined by 

u(kt) = — (9, x(kT)) 
where 7, is the first vector in the dual basis Nis 22s. ..., 72, OF the basis &,, 
Es ..., E, where E, = G-‘*'C*. For this system 7, is found to be C*. Hence 


i(kT) = —(C*, x(kT)) 
which is the negative of the output MKT). The following scheme: 
x((k + 1)T) = G[x(kT) — C*(C*, x(kT))] 
will reduce every state vector to zero at most 4 sampling periods. 
Problem A-7-16 Consider the system 
X 2. 0 0] x 
X,|2|0 2 " Xo 


X; 0 3 Lites 
The output is given by 
Xi 
y=[1 ! 1]| x 
Xs 


(1) Show that the system is not completely observable. 
(2) Show also that if the output is given by 


p 1 1i 11% 
= Xo 
X» Y 2 Xll 


the system is completely observable. 


Solution 
(1) Let us define 


2 0 0 
A=|0 2 0|, C=[i 1 1] 
0 3 1 
Then 
[C* | A* C* i (A*)?C*] 
1 2. 4 
—|1 5 13 
A, d 1 


Since the rank of this matrix is two, the system is not completely observable. (Notice 


—  — ——— 


that the degree of the minimal polynomial of A is two. Hence (A*)*C* is linearly 
dependent on C* and A*C*.) 
(2) Let us define 


pup 
c=]; 3.3 
Then 
[C* | A* C* i(A** C*] 
1 1i2 214 4 
=|1 215 13 | 13 35 
T 3i 311 3 
Clearly 
1.3 2 
12 5]#0 
13-1 


Hence the rank of this 3 x 6 matrix is three. The system is, therefore, completely 
observable. 
Problem A-7-17 (7-2) Consider the linear time-varying system 
x = A(t)x + B(r)u (7-70) 
where X — n vector (state vector) 
u — r vector (control vector) 
A(t) — n X n matrix, elements of which are piecewise 
continuous functions of time 
B(t) = n x r matrix, elements of which are piecewise 
continuous functions of time 


Prove that the system is completely state controllable on f, « 1 < t, if and only if 
B*(N@*(t,, DA z 0 
on f, « t < t, for each A z& 0 in the state space V, where (rf, tọ) is the unique 
fundamental matrix satisfying 
F(t, fo) = ABE, to), 


and À is a fixed length nonzero vector (pointing in every direction in the state space 
V). 


e(t, to) =I 


Solution If the system is completely state controllable then any 
initial state can be transferred to any final state in a finite time interval fp < 1 « t. 
We shall first convert the problem of transferring to x(t,) from some initial state 
x(t.) to. that of transferring to x%(t,) from the origin of the state space V. The 
solution of Eq. (7-70) is given by 


x(t) = ®t, ta) x(t) + f D(t, 7)B(r)u(r) dr 


Define 
x4(r) = x(t) — dt, t))x(19) 
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Then, 
x*(t) = f ' ext, 7) B(7) u(r) dz (7-71) 


The problem of transferring to x(t,) from some initial state x(r,) is thus converted 
to that of transferring to x^(t;) from the origin of the state space V. 

Notice that the system is completely state controllable on t, « t « t, if and 
only if every point x%(t,) in the state space can be reached from the origin, or in 
other words, if the region I(t), t;) of all the points x4(t;) in Eq. (7-71) which can 
be reached from the origin coincides with the whole state space V. Notice that 
IYt,, tı) is convex. Hence, in order to show that I'(t,, t;) coincides with the whole 
state space V, we need show only that [(f,, ¢;) is not bounded in any direction in 
the state space. Let us take the inner product of x^(f;) with every fixed-length 
nonzero vector A (pointing in every direction in the state space V) and show that 
it can be made as large as desired by some choice of u. 


t 
(A x(n) = S? A, (t, But) dt (7-72) 
If u(r).is chosen such that 
u(t) = KB*(r)eo*(n, OA 
where k is positive, then Eq. (7-72) becomes 
(A, x*()) = k [BOO DA, BNO, DA) dr (1-73) 
A sufficient condition for complete state controllability on t, « t « f; is that 
B*(t)o*(t, t)À 40 (7-74) 
on tj « t < t; for each A = 0 in the state space V, because then the integral in 
Eq. (7-73) is positive for every A = 0, and (A, x%(t,)) can be made as large as 
desired by increasing k. The condition (7-74) is also necessary because if it is 
violated by some A = A,, then no matter how u(f) is chosen in Eq. (7-72), (A,, 


x%(t,)) = 0 and no point x^(/;) in the direction of A, can be reached. This com- 
pletes the proof. 


Problem A-7-18 Consider the system defined by 
x = Ax + Bu 


where X — n vector (state vector) 
u — r vector (control vector) 
A =n X n constant matrix 
B =n X r constant matrix 


A necessary and sufficient condition for complete state controllability of this system 
can be stated thus: the Gram matrix given by Eq. (7-47) or 
t 
WOO, 5) = [" DO, ) BB*@*(0, £) dt 


where l @(0, t) = e-^ 


is nonsingular. Prove that this condition is equivalent to the statement that the 
composite n X nr matrix P where 
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P = [Bi AB | --- | A""!B] 


is of rank n. 
Solution The Gram matrix 
WO, 5) = [s (0, t) BB* p*(0, 1) dt 


is nonsingular implies that the n columns of B*®*(0, t) are linearly independent, 
or n rows of ®(0, 1)B are linearly independent. Notice that in the present problem 


A is a constant matrix. Hence 
p-1 j 
(0, 1)B = e-^'B = 2 a(t) AtB 
= 
where p is the degree of the minimal polynomial of A and a(t), e(t), . . . , a xt) 
are linearly independent. Since 
a(t) 
a(t) 


(0, :)B = [Bi ABi - - - | A?-1B] 


&p-(t) 
linear independence of the n rows of ®(0, ?)B implies that the rank of 
[B! AB | --- ! A77! B], pin 
is 1 which is equivalent to the statement that the rank of 
[B| AB; --- ; A*'!B] 


is n. 


PROBLEMS 


, X, are linearly independent 
«AK, are 


Problem B-7-1 Suppose that Xi, Xs, ... 
column vectors and A is a nonsingular matrix. Show that Ax;, AX», . . 
also linearly independent. 
Problem B-7-2 Consider a linear time-varying system 
x = A(x + B(r)u 
where x — n vector 
u — r vector 
A(t) = n x n matrix, elements of which are piecewise 
continuous functions of time 
B(t} = n x r matrix, elements of which are piecewise 
continuous functions of time 


Show that the solution of this equation can be given by 


x(t) = Y(t, t) [xtt) $ f : Y, f,)B(z) u(r) d7 | 


2 


where Y(f, £.) is the solution of 
EYU t) = -YE DAU), Wint) =I 


Problem B-7-3 Consider the system 


fo E 
a 


Show that the output y is independent of the control function u. 
Problem B-7-4 Suppose that a system described by 


X = Ax + Bu 
where X — n vector 
u — scalar 


A =n X n matrix 
B — n x 1 matrix 


is completely state controllable. Prove that the system described by the following 
equation is also completely state controllable. 


y = K"!'AKy + K^!Bui 
where K is an arbitrary n X n constant matrix which is nonsingular. 


Problem B-7-5 Consider a system described by the following scalar 
differential equation: 
(n) (n-0) m) (m-1) 
xax +. dotum ut eile a Toc bu, (m « n) 


n-1) 
Find the condition that for any nonzero initial conditions (0), MO) ye wary PO 
we have x(t > t,) = 0 for some 1, > 0. 
Problem B-7-6 Consider the system described by 
X = A(t)x + B(r)u 
y = C(t)x + Diu 


where x=n vector (state vector) 
u =r vector (control vector) 
y =m vector (output vector) (m <n) 


A(t) =n X n matrix 
B(t) = n x r matrix 
C(t) = m x n matrix 
D(t) = m x r matrix 


and the elements of A(t), B(t), C(r), and D(t) are piecewise continuous functions of 
time. The output can be given by 


Y(t) = Ce, &)x() + [^ H C, vu) dr + Dul) 


where H,(t, 7) = C(t) @(t, 7) B(7) 
and @(t, f,) satisfies the equation 
SOUL, t) = AOU, i) (5) =I 


Find the condition that the system is completely output controllable on tp < t < 1. 
Write the condition in terms of an appropriate Gram matrix involving H,(t, 7) 
and D(r). 


Problem B-7-7 (7-8) Consider the system 
x = A(t)x + B()u (7-75) 
where X = n vector (state vector) 
u = scalar (control signal) 
A(t) =n X n matrix, elements of which are piecewise 
continuous functions of time 
B(r) =n x 1 matrix, elements of which are piecewise 
continuous functions of time 


The solution of Eq. (7-75) is 
x(t) = DE, to) x(t) + f^, t, 7)BG)ut) d7 


= W(t, ty) x(t.) + f ; h(t, 7)u(r)dv 


where h(r, 7) is the response vector of the system to an impulse u(r) = 8(t — 7) 
applied at ż = 7. Assume that the functions /,(t,7) are linearly independent on all 
time intervals, t, < 7 < t. Namely, for every nonzero n vector A the inner product 
of A and h(t, 7) is nonzero, or 


(A;h(z; T) + 0 on ob<r<t 


Prove that this condition is equivalent to the statement that the system can be 
brought by a suitable control signal u from any initial state at ¢ = f, to any final 
State x(t,) at t = t, on an arbitrary finite time interval, tọ << t « ft). 


Problem B-7-8 Consider the system defined by 


x = Ax 
y —Cx 
where x — n vector (state vector) 


y =m vector (output vector) 
A =n X n constant matrix 
C = m X n constant matrix 


A necessary and sufficient condition for complete observability of this system can 
be stated that the following Gram matrix be nonsingular: 


M(0, t) = j3 D*(t, 0)C* C@(t, 0) d 
where (r, 0) = e^t 


436 CONTROLLABILITY AND OBSERVABILITY 


Prove that this condition is equivalent to the statement that the composite n X mn 
matrix P where 


P = [C* |A*C* į -+ (A*)"1C*] 
is of rank n. 
Problem B-7-9 Consider the system 
X = A(t)x + B(r)u 
y = C(:)x 


Prove that if z(r) = 0 (for t > to) is the only solution to the following two simul- 
taneous equations: 


(7-76) 


Z = —A*(t)z, B*(t)z 2 0 
then the system defined by Eq. (7-76) is completely state controllable. 
Prove also that if x(t) = 0 (for t > to) is the only solution to the following two 
simultaneous equations: 
x = A(t)x, C(t)x = 0 
then the system defined by Eq. (7-76) is completely observable. 
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Stability Analysis 


8-1 INTRODUCTION (8-/, 8-2, 8-3)! 


The degree of complexity of stability analysis of dynamic systems increases 
rapidly as the description of system differential equations is modified from 
linear time-invariant equations to linear time-varying equations and to non- 
linear equations. A variety of stablility criteria is applicable to linear time- 
invariant systems. Only several methods are applicable to stability analysis 
of linear time-varying systems and nonlinear systems. Most of these are 
quite limited in their applications. hes 

This chapter presents Routh-Hurwitz stability criteria which are ap- 
plicable to linear time-invariant systems and the Liapunov methods of 
stability analysis (the first method of Liapunov and the second method of 
Liapunov) which are applicable to both linear and nonlinear systems. We 
shall devote much attention to the second method of Liapunov, which 
provides stability information on linear and nonlinear differential equations 
without solving them; hence Liapunov’s second method is called the direct 
method of Liapunov. Because, in general, explicit solutions of nonlinear 
differential equations are wholly out of the question, the second method 
of Liapunov is most useful for investigating the stability of nonlinear systems. 
This method gives sufficient conditions for asymptotic stability of equili- 
brium states of nonlinear systems and gives necessary and sufficient conditions 
for asymptotic stability of equilibrium states of linear time-invariant systems. 


lFigures in parentheses refer to material listed at the end of this chapter. 
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In addition to stability analysis the second method of Liapunov is quite 
useful in designing optimal control systems that are stable. 

In the following portions of this section we give definitions of several 
terminologies necessary for stability analysis of dynamic systems. 


System The system we consider in this chapter is defined by 


X fx) (8-1) 
where x is a state vector (n vector) and f(x, 1) is an n vector whose elements are 
functions of x;, X», ..., x, and t. We shall assume uniqueness of solutions and 
a continuous dependence of solutions on initial conditions. We shall denote 


the solution of Eq. (8-1) as h(t; Xo, to), where x = x, at t = t, and t is the 
observed time. Thus, 


(to; Xo, fo) = Xo 


Motion In defining stability, asymptotic stability, and insta- 
bility we shall frequently use the word "motion." A motion is defined as a 
trajectory starting from any state or any point in the n-dimensional state 
space. 


Equilibrium states In the system of Eq. (8-1) a state x, where 


f(x. f) for all t (8-2) 
is called an equilibrium state of the system. If the system is linear time-invari- 


ant, namely f(x, £) = Ax, then there exists only one equilibrium state if A is 
nonsingular, and there exist infinitely many equilibrium states if A is sin- 


gular. For nonlinear systems there may be one or more than one equilibrium . 


state. The equilibrium states correspond to the constant solutions of the sys- 
tem. (x = x, for all £.) The determination of the equilibrium states does not 
involve the solution of the system's differential equation, Eq. (8-1), but only 
the solution of Eq. (8-2). 

If equilibrium states are isolated from each other, they are called isolated 
equilibrium states. Any isolated equilibrium state can be shifted to the origin 
of the coordinates, or f(0, t) = 0, by a translation of coordinates. In this book 
we shall treat stability analysis of only isolated equilibrium states. Therefore, 
in this book an equilibrium state x, should be interpreted as an isolated 
equilibrium state. (Most of the existing stability theorems are concerned with 
stability or asymptotic stability or instability of isolated equilibrium states.) 


Stability An equilibrium state x, of the system of Eq. (8-1) 
is said to be stable if for each real number e > 0 there is a real number 
&(e, to) > O such that the inequality 


[xo — xl < 8 


implies 


lldt; Xo, to) — Xel] < € for all t Z to 


The real number 8 depends on e and, in general, also depends on ʻo. If è 
does not depend on ża the equilibrium state is said to be uniformly stable. 
Figure 8-1 shows a stable equilibrium state x, of a second order system and 
the representative trajectory starting from x,. In Fig. 8-1 S(e) and S(8) are 


5(8) 


avr. 
4 S(e) 


$ (ixo) He) 
Figure 8-1 Stable equilibrium state x, and representative trajectory starting from Xp. 


respectively circular regions of radii e > 0 and 8 > 0 about the equilibrium 
state x,. [In the case of an n-dimensional state space any closed, bounded 
surfaces can be used in place of circles to define regions S(e) and S(8).] S(e) 
consists of the states x satisfying ||x — x,||< e. This figure shows that 
corresponding to each S(e) there is an S(8) such that a motion starting at a 
state x, in S(8) does not leave S(e), or x, in S(8) implies P(t; Xo, to) is in S(e) 
for all £ > fe. That is, a motion starting in S(5) never reaches the boundary 
circle H(e) of S(e). 

The solutions of Eq. (8-1) are said to be bounded if there exists for a given 
8 > 0 a constant e(8, ta) such that 


xo — xell < 9 
implies 
ld(t; xo, to) — Xel| < €, to) 
If e does not depend on że, the solutions are said to be uniformly bounded. 


for all t > to 


Asymptotic stability An equilibrium state x, of the system 
given by Eq. (8-1) is said to be asymptotically stable if it is stable and if 
every solution starting at a state x, sufficiently near x, converges to x, as 1 
increases indefinitely. Namely, given two real numbers e > 0 and u > 0 
there are real numbers 8 > 0 and T(x, 8, fo) such that 


|Xo — Xell < 6 
implies 
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| (zt; xo, to) — Xell Ce for all t > to 


and 
ll(t; Xos to) — Xell Ku forall t >t) + T(p, 8, to) 


Figure 8-2(A) shows an asymptotically stable equilibrium state x, of a 
second order system and the representative trajectory starting from x,. Figure 
8-2(B)isa plot of the representative trajectory as time elapses. In these figures, 
S(e), S(8), and S(u) are, respectively, circular regions of radii e > 0, 8 > 0, 
and w > 0 about the equilibrium state x.. If the equilibrium state x, is 
asymptotically stable, then every motion starting at a state x, in S(8) con- 
verges, without leaving S(e), to the origin as time increases indefinitely. 

In practice, asymptotic stability is more important than mere stability. 
Since asymptotic stability is a local concept, merely to have established 
asymptotic stability may not mean that the system will operate properly. 
Some knowledge of the size of the largest region of asymptotic stability is 
usually necessary. Such largest region of asymptotic stability is called the 


S(8) 
(A) S(e) 


thread HG) 


(B) 


Figure 8-2 (A) Asymptotically stable equilibrium state x, and representative trajectory 
starting from xg; (B) plot of representative trajectory as time elapses. 
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domain of attraction. It is that part of the state space in which asymptotically 
stable motions originate. In other words, every motion originating in the 
domain of attraction is asymptotically stable. 


Asymptotic stability in the large If asymptotic stability holds 
for all states (all points in the state space) from which motions originate, the 
equilibrium state is said to be asymptotically stable in the large. Namely, the 
equilibrium state x, of the system given by Eq. (8-1) is said to be asymptotical- 
ly stable in the large if it is stable and if every solution converges to x, as t 
increases indefinitely. Obviously a necessary condition for asymptotic 
stability in the large is that there be only one equilibrium state in the whole 
state space. 

In control engineering problems, the desirable feature is asymptotic 
stability in the large. If the equilibrium state of a system is not asymptotically 
stable in the large, then the problem becomes that of determining the largest 
region of asymptotic stability. This is usually very difficult. For practical 
purposes, however, it is sufficient to determine a region of asymptotic 
stability large enough so that no disturbance will exceed it. 


Instability An equilibrium state is said to be unstable if it is 
neither stable nor asymptotically stable. Figure 8-3 shows an unstable 
equilibrium state x, of a second order system and a representative trajectory 
starting from x,. As may be seen from Fig. 8-3, in the case of an unstable 
equilibrium state, for some real number e > 0 and any real number 60, 
no matter how small, there is always in the circular region S(8) a state x, 
such that the motion starting from this state reaches the boundary circle 
H(e) of S(e). 

The foregoing definitions are the minimum requirement for understanding 
the stability analysis of linear and nonlinear systems presented in this chapter. 
Note that these definitions are not the only ones defining concepts of 
stability of an equilibrium state. In fact, various other ways to define concepts 


$ (^. xg, /9) (> 
(9 Sle) 


Hle) 


S (8) 


Figure 8-3 Unstable equilibrium state x, and representative trajectory starting from Xp. 
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of stability are available in the literature. In this chapter, however, our 
discussions on stability analysis will be based on the definitions just given. 


Outline of the chapter The following sections of this chapter 
present classical Routh-Hurwitz stability criteria applicable to linear time- 
invariant systems (Section 8-2); the first method of Liapunov (Section 8-3); 
fundamental theorems of the second method of Liapunov (Section 8-4); 
stability analysis of linear time-invariant systems by the second method of 
Liapunov (Section 8-5); a proof of Routh-Hurwitz stability criteria (Section 
8-5); stability analysis of nonlinear systems (Section 8-6); stability analysis 
of discrete-time systems (Section 8-7); a brief discussion of applications of the 
second method of Liapunov to system design (Section 8-8). Section 8-9 
presents some concluding comments. 


8-2 ROUTH'S AND HURWITZ STABILITY CRITERIA 


Stability analysis of linear time-invariant systems is important not only 
because many practical systems can be approximated by linear time-invariant 
models but also because local stability of an equilibrium state of a nonlinear 
system may be examined by stability of a linear time-invariant model which 
may be obtained by use of a linearization technique. 

Consider the system 


X = Ax 
where x is a state vector (n vector) and A is an n x n nonsingular martix. 
The equilibrium state is clearly the origin, x — 0. The nature of the solution 


of this equation is determined from the eigenvalues of A, or the roots of the 
characteristic equation 


[A — AI — 0 


If the coefficients in the characteristic equation are all real, then the roots of 
this equation are either real or complex conjugate. If the number of the dis- 

tinct roots is p, then the n components of the solution x can be written as 
x= Dale  G—L2....) 

j= 

where the X, are the distinct eigenvalues of A, and the g;;(t) are either con- 
stants or polynomials in ¢. If all of the eigenvalues of A have negative real 
parts, then the equilibrium state x = 0 of the system is asymptotically stable. 
Once we calculate the characteristic polynomial of A a variety of stabil- 
ity criteria can be applied to determine whether all the roots have negative 
real parts. Two commonly used criteria for this purpose are presented in this 
section. They are Routh’s stability criterion and Hurwitz stability criterion. 
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(Although these two stability criteria may seem different from each other they 
are actually equivalent and furnish the same information on stability.) 

In the following we first present Routh's stability criterion and then 
Hurwitz stability criterion. A proof of these stability criteria is given in 
Section 8-5. 


Routh’s stability criterion Consider the following polynomial 


with constant coefficients: 

AN” + AA" + aX"? + ++-+a,1r +a, =0 (8-3) 
where coefficients a’s are real quantities. [Often Eq. (8-3) is divided by ap, 
making the coefficient of the A” term unity.) 

If a polynomial with constant but undetermined coefficients is given, 
Routh’s stability criterion gives the condition for the coefficients of the 
polynomial to be satisfied in order that all the roots have negative real 
parts. [This is the condition for the equilibrium state of the system whose 
characteristic equation is given by Eq. (8-3) to be asymptotically stable. 
If the polynomial with known constant coefficients is given, Routh’s stability 
criterion determines how many roots of the polynomial have negative real 
parts, how many have positive real parts, and how many have zero real parts. 

The procedure for stability analysis by this criterion is as follows: 


1. If any coefficients a’s of Eq. (8-3) are zero or negative in the presence 
of at least one positive coefficient, there is at least one zero root, or at 
1east one root which has a positive real part or roots which are imagi- 
nary. (Therefore, in such a case the system is not asymptotically stable 
and if asymptotic stability is to be determined there is no need to fol- 
low the following procedure.) Note that the condition that all the 
coefficients a’s are positive is a necessary condition. This may be seen 
from the following: A polynomial in X having real coefficients can 
always be factored into linear and quadratic factors, such as (X + a) 
and (A? + bA + c) where a, b, and c are real. The linear factors yield 
the real roots and the quadratic factors yield the complex roots of the 
polynomial. The factor (A? + bà + c) yields roots having negative 
real parts only if b and c are both positive. In order for all roots to have 
negative real parts, constants a, b, c, etc., in all factors must be posi- 
tive. The product of any number of linear and quadratic factors con- 
taining only positive coefficients always yields a polynomial with 
positive coefficients. The condition that all the coefficients a's are 
positive is, however, not sufficient to insure stability. The necessary 
(but not sufficient) condition for stability is that the coefficients a's of 
Eq. (8-3) must all be present and must all have the positive sign. 
(If all the constants a’s are negative, they can be changed into positive 
by mutliplying —1 to both sides of the equation. Hence we need 
consider only the case of positive sign.) 
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2. If all the coefficients a's are positive, arrange the coefficients of the 
polynomial in rows and columns according to the following pattern: 

x Qo ds Gy dg «— first row 

ACCU wo Gg Gp y 


AT = b, b, b, b, 


-3 
A” €i €» C3 Cg 


< second row 


X d d 0 
A? ey eo 0 
X RO 
X gı 

T 
first column 


The constants b,, b», b;, etc., are evaluated according to the following 
schedule: E 
b = did» — Qa; 
a, 
b: = Qa; — dod; 
a, 
b,-— ads — sd; 
a, 


The evaluation for b’s is continued until the rest of the b’s are all equal 
to zero. The same schedule of cross multiplying the coefficients of the 
two previous rows is used for evaluating the coefficients c's, d's, e's 
etc. Namely, iih 


= bia; as a, by 


Ci bi 
= bia; — a,b; 
Co Ba 
C3 = ba; xm a, b, 
1 
— 6d — die 
fi z 
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and 
Si = €s 
(Evaluation of the coefficients is continued until the (n + 1)th row is 
completed. The Routh array of coefficients is triangular.) It is noted 
that in developing the array the entire row may be divided or multiplied 
by a positive number in order to simplify the numerical calculation 
without altering the results on stability. Routh’s stability criterion 
states that the number of roots (of the polynomial) with positive real 
parts is equal to the number of changes of sign of the coefficients 
in the first column. Note that the exact values of the terms in the first 
column need not be known, instead only signs of the terms are needed. 
The necessary and sufficient condition that all the roots of Eq. (8-3) 
have negative real parts is that all the coefficients a's of Eq. (8-3) are 
positive and all the terms in the first column of the array have positive 
signs. 
Example 8-1 If Routh's stability criterion is applied to the following 
third-degree polynomial: 
aM + air? axo a = 
where all the coefficients a's are positive numbers, then the Routh array of co- 
efficients becomes 


X Go az 

7 a a3 

A! Q1» — dod35 0 

ai 

A a3 
The condition that all the roots have negative real parts is given by aia; > aya3. 
As the degree of the polynomial increases, the conditions for asymptotic stability 
become more complicated. 

Example 8-2 Consider the following example: 
M + 243 + 3A? + 444+5=0 

Following the procedure just presented the Routh array of coefficients is con- 
structed. The first two rows can be obtained directly from the given polynomial. The 


remaining terms are obtained by the given schedule. (If any coefficients are missing, 
they are replaced by zeros in the array.) 


M 3 5 &B.rzss 

AS 2 4 0 As y » ue secon d — 
one sign p 1s 0 one sign w 15 — 
change "T change i ud d 
one sign one sign A 
change ‘A? 5 ! change *A? 5 


(————————1 


In this example the number of changes of sign of the coefficients in the first col- 
umn is two. This means that there are two roots with positive real parts. In this 
example it is demonstrated that the result is unchanged when the coefficients of 
any row are multiplied or divided by a positive number in order to simplify the 
evaluation of coefficients. ; 
3. If the first column term in any row is zero but the remaining terms 
are not zero, the zero term is replaced by a very small positive number 
e and the rest of the array is evaluated. 


Example 8-3 Consider the following equation: 
X^-2 -XA-2-0 (8-4) 
The Routh array of coefficients is 


A 1 1 
A? 

AP (se 0 
X? 2 


If the sign of the coefficient above the zero (e) is the same as that below it, it indi- 
cates that there are a pair of imaginary roots. Actually Eq. (8-4) has roots X = j 
and X = —j. If, however, the sign of the coefficient above the zero (e) is opposite 
that below it, it indicates that there is one sign change. For example, for the follow- 
ing equation: 


P= DI —X 4-22 EDA — 19 — 2) 
the array of coefficients is 


A? 1 m 


one sign change Lo E 2 
one sign change (v One 0 
An 2 


Thus, there are two sign changes of the coefficients in the first column, which agrees 
with the correct result indicated by the factored form of the polynomial equation. 


4. If all the coefficients in any derived row are zero, this indicates a 
pair or pairs of real roots with equal magnitudes but opposite signs 
and/or a pair or pairs of conjugate imaginary roots. In such cases, 
evaluation of the rest of the array can be continued by forming an 
auxiliary polynomial with coefficients of the last row and taking the 
coefficients of the derivative of this polynomial for the next row. Such 
roots with equal magnitudes but opposite signs can be found by 
solving the auxiliary polynomial which is always even. For a 2nth- 
degree auxiliary polynomial, there are n pairs of equal and opposite 
roots. 


" dr ees deir arr quic quoi. AAS REESE Est SUMI Piece esie MONUIT o TS PUN SUMI oT TES iin SoS ERES TO da 3 
H penaa " 


Example 8-4 Consider the following equation: 
AS + 2At + 2403 + 48A? — 25A — 50 = 0 
The Routh array of coefficients is 
A5 1 24 —25 
A! 2 48 —50 — auxiliary polynomial P(X) 
AP» 0 g 
The terms in the A? row are all zero. The auxiliary polynomial is then formed from 
the coefficients of the A* row. The auxiliary polynomial P(A) is given by 
P(A) = 204 + 48A? — 50 
which indicates that there are two pairs of roots of equal magnitudes and opposite 
signs. These two pairs of roots are obtained by solving the auxiliary.polynomial 
equation P(X) = 0. The derivative of P(A) with respect to X is obtained as 
APOS _ $33 4 96, 
a 8X3 + 9 
The terms in the A? row are replaced by the coefficients of the last equation, namely, 
8 and 96. The array of coefficients then becomes as follows: 


AS 1 24 —25 
At 2 48  —50 
AS 8 96 0 — coefficients of dP) 


dX 

A? 24 —50 0 

A! 112.7 0 

A» —50 

It is seen that there is one change in sign in the first column of the new array. Thus, 


the original equation has one root with a positive real part. By solving for roots 
of the auxiliary polynomial equation, 


2A + 484? — 50 = 0 


one sign change C 


we obtain 
A=+1 or ANS 


These two pairs of roots are part of the roots of the original equation. As a matter 
of fact the original equation can be written in a factored form as follows: 


(X + 1). — 1). + 5) — SA + 22 = 0 
Clearly, the original equation has one root with a positive real part. 
5. Finally it is noted that if the following substitution is made in Eq. 


(8-3) 
A= ypo 


where c is a constant and the polynomial is written in terms of u, then 
the application of Routh's stability criterion reveals the number of 
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roots which have real parts greater than —c. Namely the number of 
changes of sign in the first column of the Routh array developed for 
the polynomial in w is equal to the number of roots which have 
real parts greater than —o. , 


Hurwitz stability criterion Hurwitz stability criterion gives 
conditions for the coefficients of the polynomial given by Eq. (8-3) to be 
satisfied in order that all the roots have negative real parts. As stated in 
preceding discussions, in order that all the roots have negative real parts, 
all the coefficients a’s must be positive. This is a necessary condition but not 
a sufficient condition. If this condition is not satisfied it indicates that some of 
the roots have positive real parts or are imaginary or zero. The sufficient 
condition for all the roots to have negative real parts is given in the following: 
If all the coefficients of the polynomial are positive, arrange these coefficients 
in the following determinant: 


a @; dg «s+ O 0 0 
Ao Qn a j 
Qa, Q3; Qn 0 0 
A, = ao ay ü. O 0 
Gs ay Q 
An-3 an-ı O 
9 6 B die Gee VR 


where we substituted zero for a, if s > n, or alternatively, 


a, a 0 0 = . 0 
Q5 a, ay Qo s . 0 
a; Q4 ds ay s : 0 
A = 
0 Ge Le ce Caw des 
0 ZA 4A WT o 
0 0 0 0 ds 


For all the roots to have negative real parts, it is necessary and sufficient 
that successive principal minors of A, be positive. The successive principal 
minors are the following determinants: 


F 
| 
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a, d; Q»i-1 a a e. 0 
QA, Ge Q»i-2 a5 a: 
0 a ü»i-3 as a; 
Ai — = 
O: Ob s Qi Q»-i1i Gri-2 a; 


cI, 5. oon) 


where a, = 0 if s > n. (It is noted that some of the conditions for the lower- 
order determinants are included in the conditions for the higher-order 
determinants.) If all these determinants are positive, and a, > 0 as already 
assumed, the equilibrium state of the system whose characteristic equation 
is given by Eq. (8-3) is asymptotically stable. Note that exact values of deter- 
minants are not needed, instead only signs of these determinants are needed 


for the stability criterion. 


Example 8-5 For the following characteristic equation: 
aon! + an + aor? + aÀ +a = 0 


the conditions for asymptotic stability are that all the a's be positive and that 


Qa; a3 
Az = = did; — apa; > 0 
ag a2 
aq Q3 0 
A3 =| @ Q0» a 
0 a, Q3 


aı(aza; — a144) — aaj 


= alaia: — aoa) — aja, > 0 


It is clear that if all the a’s are positive and if the condition A; > 0 is satisfied, the 
condition A: > O is also satisfied. Therefore, for all the roots of the given character- 
istic equation to have negative real parts, it is necessary and sufficient that all the 
coefficients a’s are positive and A; > 0. 


Equivalence of Routh’s stability criterion and Hurwitz stability 
criterion The equivalence of Routh’s stability criterion and Hurwitz stability 
criterion can be easily established. If we write the Hurwitz determinants 


in a triangular form such that 


| 0° d; 
where the elements in the lower left of the main diagonal are zero and those 
in the upper right of the main diagonal are zero or nonzero, then, clearly 

A, = dud» +++ d (i= 1,2, ..., n) 


Using this fact the Hurwitz conditions 


A, > 0, A: > 0, Bg 0. ous Ne SO 
reduce to the conditions 
Qui 0, an > 0, a33 > 0, ea. Maa 2» 0 


It can be shown without difficulty that in terms of a;, b, c,, and so on, in the 
Routh array, a,,, dos, @33, . . . can be obtained as 


, — d, Qo = bi, 053 = Cy, 
Hence the Hurwitz conditions for asymptotic stability become 
a, > 0, b, > 0, €, > 0, 


which are the Routh conditions for asymptotic stability. (For details, see 
Prob. A-8-3) In this way the equivalence of Routh’s stability criterion 
and Hurwitz stability criterion can be established. 


8-3 FIRST METHOD OF LIAPUNOV (8-4) 


The first method of Liapunov consists of all procedures in which the explicit 
form of the solution is used for stability analysis. In this method each 
equilibrium state, if there is more than one equilibrium state, is investigated 
separately. j 

Consider the nonlinear system 


x = f(x) 1 (8-5) 
where x is a state vector (n vector), and f(x) is an n vector and is continuously 
differentiable in xi, x», .. . , x,. Let us expand the nonlinear vector function 
f(x) in Taylor series about the equilibrium state x, in question. Introducing 
a new vector, y= x — x, the equilibrium state can be shifted to the 
origin. By expanding f(x) in Taylor series about x — Xe, Eq. (8-5) becomes 


y = Ay + G(y 
where A is the n X n Jacobian matrix given by 


E AA s WA 
gxi t ÓX, 
af, Of ... Af 
A= Ox, OX» OX, 
of fe... 0f 
Ox, OX» OXn 


where fi, fo,...,f, are the n components of f(x). All the partial derivatives 
appearing in the Jacobian matrix A are evaluated at the equilibrium state 
X = X, or y = 0. The n x n matrix G(y) contains terms arising from the 
higher-order derivatives in the Taylor series expansion. The elements of G(y) 
vanish at the equilibrium state. Equation (8-5) can thus be linearized near the 
origin as follows: PaP ax 
Equation (8-6) is the first approximation to the nonlinear equation, Eq. 
(8-5). We say that Eq. (8-6) is obtained from Eq. (8-5) by linearization. 
Liapunov showed that if all the eigenvalues of the constant matrix A in 
Eq. (8-6) have nonzero real parts, then the stability of the equilibrium state 
X, of the original nonlinear equation, Eq. (8-5), is the same as that of the 
equilibrium state y — 0 of the linearized equation, Eq. (8-6). Hence if the 
eigenvalues of A have negative real parts, then the equilibrium state x, is 
asymptotically stable and all solutions of the system given by Eq. (8-5), 
with initial state x(0) sufficiently close to x., approach x, as z — co and if 
at least one of the eigenvalues of A has a positive real part, then the equi- 
librium state x, is unstable. If, however, at least one of the eigenvalues of A 
has a zero real part, the local stability behavior of the equilibrium state 
X = x, of the system, Eq. (8-5), cannot be determined by Eq. (8-6). We 
shall call this case the critical case. In the critical case the local stability 
behavior of the equilibrium state of the system depends on higher-order 
terms of the Taylor series expansion of f(x). For example, if one eigenvalue 
of the Jacobian matrix A is zero at the equilibrium state, then this state is 
asymptotically stable if in its neighborhood A has only eigenvalues with 
negative real parts. The equilibrium state is unstable, however, if in its 
neighborhood there is at least one eigenvalue with a positive real part. 

The conclusions by the stability analysis in the first approximation apply 
only in the neighborhood of the equilibrium state. Namely, the first method 
of Liapunov concerns only stability in the small ; that is, whether a motion 
originating from a neighborhood of an equilibrium state Xe will approach 
x, as t increases indefinitely. The method gives no indication of how small 
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the neighborhood must be in order that the motion approach x, as t — oo. 
Stability analysis in the large is more difficult than stability analysis in the 
small. The second method of Liapunov to be presented in section 8-4 answers 
some of the questions of stability analysis in the large as well as local stability 
analysis of linear and nonlinear systems. 


8-4 SECOND METHOD OF LIAPUNOV! 


The second method of Liapunov attempts to give information on the stability 
of equilibrium states of linear and nonlinear systems without any knowledge 
of their solations. The stability information obtained by this method is 
precise and involves no approximation. 

The second method of Liapunov is based on a generalization of the idea 
that if the system has an asymptotically stable equilibrium state, then the 
stored energy of the system displaced within the domain of attraction 
decays with increasing time until it finally assumes its minimum value at the 
equilibrium state. The second method of Liapunov consists of determination 
of a fictitious “energy” function called a Liapunov functicn. The idea of the 
Liapunov function is more general than that of energy and is more widely 
applicable. Liapunov functions are functions of xj, xs ..., X», and t. 
We denote Liapunov functions as V(x,, X, ... , Xn, t) or simply as V(x, t). 
If Liapunov functions do not include ¢ explicitly, then we denote them as 
V(X;, Xs ..., Xn) or V(x). Liapunov functions can sometimes be expressed 
as functions of not only Xi; Xa; xas: X,, f, but also Xj, X;, ... , X4, where the 
bar denotes the complex conjugate. In this book we consider only physical 
systems. In such systems complex elements of the state vector x must 
occur as complex conjugate pairs. Hence for such systems V(x,, x,,..., 
Xj Xj Ric ice Hay f) can.be expressed simply as F(X fi -e-s Xa, t) Or 
V(x, t), because if x; and x; are a complex conjugate pair then X; = x;, and 
if x, is real then X, = x,. In the second method of Liapunov the sign be- 
haviors of V(x,t) and its time derivative V(x,f) give information on 
stability, asymptotic stability or instability of the equilibrium state under 
considération without directly solving for the solution. (This applies to both 
linear and nonlinear systems.) 

In this section we first present Liapunov's main stability theorem and its 
proof and then present several modified forms of the stability theorem. We 
assume that the equilibrium state under consideration is at the origin of the 
state space. The essence of the second method of Liapunov is given in the 
following main theorem: 


This section follows closely (8-2, 8-3). Numbers in parentheses refer to titles in the 
references listed at the end of the chapter. 


| 
| 
| 
| 
| 
| 
| 
| 
| 
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Let the 


Theorem 8-1  (Liapunov's main stability theorem) 
system be defined by Eq. (8-1) which is rewritten thus: 
X = f(x,t) 
Suppose that 
f(0, 1) — 0 for all t 


Suppose also that there exists a scalar function Vix, t) which has continuous 
first partial derivatives. If V(x, t) satisfies the following conditions: 


1. V(x, t) is positive definite, namely, V(0,t) = 0 and 

2. V(x, t) > a (lxi) > 0 for all x Æ 0 and all t where œ is a continuous, 
nondecreasing scalar function such that a(0) = 0. . » 

3. The total derivative V is negative for all x #0 and all t, or V(x, t) < 
—y (||x\|) < 0 for all x # 0 and all ¢ where y is a continuous non- 
decreasing scalar function such that y(0) = 0. 

4. There exists a continuous, nondecreasing scala 
80) = 0 and, for all t, Vix,th< A(\\x!)) l 

5. a(\|x||) approaches infinity as ||x|| increases indefinitely, or 


r function such that 


a(xl) — o» — as ||x|| > ee 


then the origin of the system, x — 0, is uniformly asymptotically stable in 


the large. 


Proof To prove uniform asymptotic stability in the large we 
need to prove the following: 
1. The orig.n is uniformly stable. 
2. Every solution is uniformly bounded. P 
3. Every solution converges to the origin as t — co uniformly in f, an 
||xo|| < 8 where 6 is fixed but arbitrarily large; that is, given two rea 
numbers 8 > 0 and y > 0 there is a real number T(p, 8) such that 


[xol] < 8 
implies 
IOl: Xo to] << m forall t > to + Tlu, 9) 


i i i i ial equation. 
where (1; Xo, fo) is the solution to the given differentia 
Since £ is continuous and (0) = 0, we can take 8(e) > 0 such that 
B) < ae) for any e > 0. Figure 8-4 shows curves a(\|x|), Axi), and 


V(x, t). Noting that 
V(d(t; Xo, to), t) ic V(Xo, to) 


=f ' Y(&(r;xo t) )dr«0, h 
to 


Bx 


ale) -----------,<~---f--------------- 


B(8) r-----7 


[o] S(e) € lxil 
Figure 8-4 Curves a(|x]|), &(Ix]), and V(x, t). 


if ||Xo|| < 8, ty being aribtrary, we have 


a(e) > PÈ) > V(xo, to) > V (CE Xo, to), t) > alt; xo, to)ll) 
for all ¢ > to. Since & is nondecreasing and positive, this implies that 


llé(xs 8) «e for t> ta Ilx <ê 


Hence we have shown that for each real number e > 0, there is a real number 
8 > 0 such that ||x,|| < 8 implies ||(+; Xo, to)|| < e for all t > tẹ Thus we 
have proved uniform stability. 

Next, we shall prove that ||ó(t; xo, to)||—0 with t — co uniformly in 
t, and ||xo|| & 6. Let us take any 0 < u < ||x;|| and find a v(u) > 0 such 
that B(v) < ou). Let us denote by e'(u, 8) > 0 the minimum of the conti- 
nuous nondecreasing function y(Jx|) on the compact set »(4) < ||x||< 
e(8). Let us define 


MNT (u, ò) 
Suppose that ||(t; Xo, fo)|| > v over the time interval t, <t < t; = t, + T. 


Then, we have 


0 < av) < V(d(ti; Xo, to), ti) < V(X, to) — (tı — toe’ < 8(8) — Te’ = 0 


which is a contradiction. Hence for some 1 in the interval t, «tx h, say, 
t, we have 


|[x2]] = I(t; xo, to)|| = v 


Therefore, 
CU(|| P(t 5 x», tl) < VCE; Xas ta), 1)  V(x» te) < B(v) < a (2) 
for all t > t,. Hence, 
llt Xo, t9)l] < u 


for all. t > f, + T(u, 8) >t, which proves uniform asymptotic stability. 
Since a(|x| — ee as ||x]|] — co there exists for arbitrary large 6 a constant 
e(5) such that (8) < « (e). Moreover, since e(8) does not depend on fə, the 
solution (ft; Xo, to) is uniformly bounded. We thus have proved uniform 
asymptotic stability in the large. 


Liapunov function The Liapunov function V(x, 1), a scalar 
function, is a positive definite function, and it is continuous together with its 
first partial derivatives (with respect to its arguments) in the region Q about 
the origin and has a time derivative which, when taken along the trajectory, 
is negative definite (or semidefinite). Notice that V(x, 1) is actually the total 
derivative of V(x, t) with respect to t along a solution of the system. Hence 
V(x, t) < 0 implies that V(x, t) is a decreasing function of t. A Liapunov 
function is not unique in a given system. (For this reason, the second method 
of Liapunov is a more powerful tool than conventional energy consider- 
ations. Note that a system whose energy E decreases on the average, but not 
necessarily at each instant, is stable but E is not a Liapunov function.) 

The simplest positive definite function is of a quadratic form 


V(x) = 2; 2 quxix; (53 —1,2,...,8) 


In general, Liapunov functions may not be of a simple quadratic form. For 
any Liapunov function, the lowest-degree terms in V must be even. This can 
be seen as follows: If we define 


then in the neighborhood of the origin only the lowest degree terms become 
dominant and we can write V(x) as 


V(x) = xh V(z, Zase s e3 Zn- 1) 


If we keep the z, fixed, V(z;, Z», ... , Zn-1, 1) is a fixed quantity. For p odd, 
x? can assume both positive and negative values near the origin, which means 
that V(x) is not positive definite. Hence p must be even. 

The second method of Liapunov is simple to apply in principle, but in 
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practice it is not necessarily easy for many cases, because no general method 
for constructing Liapunov functions is available at present, although there is 
a great flexibility in choosing a Liapunov function. There is, however, no 
need to solve the differential equation. It should be noted that failure to find 
a suitable V function does not imply instability but merely means that the 
stability condition cannot be guaranteed. 


Example 8-6 (8-2) Consider the system described by 


X; = xs — axi(x? + x1) 
8-7 
Xs = —xi— ax«(xi $ xi) ( ) 
where a is a positive constant. Clearly the equilibrium state is the origin, or x = 0. 
If we define a scalar function V(x) of the state variables by 


Vix) = x? + x3 


which is positive except at the origin where it is zero, then the time derivative V(x) 
along a solution of Eq. (8-7) is 


V(x) = VO) — alt + xb? 


which is negative unless x; = x; = 0. This shows that V(x) is constantly decreasing 
along any solution of Eq. (8-7), and hence V(x) is a Liapunov function. Let V(x) 
take constant values, 0, Ci, Co, €,...,(0 OC, < C4 <C3-- J Then F= 0 
corresponds to the origin of the state plane. V = Cy, V = Cs,... describe non- 
intersecting circles enclosing the origin of the state plane as shown in Fig. 8-5. 


*s 


V Increase 


Figure 8-5 Constant V contours and representative trajectory. 


| 
| 
| 
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The circle V = Cy, lies completely inside of the circle V = C;,,. A representative 
trajectory crosses the boundary of V contours from outside toward inside. From 
this, the geometric interpretation of a Liapunov function may be stated as follows: 
V(x) is a measure of the distance of the state x from the origin of the state space. 
If the distance between the origin and the instantaneous state x(t) is continually 
decreasing as / increases (that is, V(x(r)) < 0), then x(t) > 0. Since a Liapunov 
function V(x) is regarded as defining distance from the origin in the state space, 
its derivative can be used to give a quantitative estimate of the speed with which 
the origin is approached. Note that in this example V(x) becomes infinite with 
infinite deviation from the equilibrium state. Hence the equilibrium state, the 
origin, of the system is asymptotically stable for all possible initial conditions, or 
asymptotically stable in the large. 


Several stability theorems which are based on Theorem 8-1 are now 
presented. The system differential equation considered in Theorems 8-2 to 
8-6 is defined by 

xx.) (8-8) 
where x is a state vector (n vector) and f(x, t) is an n vector having con- 
tinuous first partial derivatives in the region Q (Q can be the entire state 
space) which includes the origin. We assume that f(0, 1) = 0; namely, there 
is an equilibrium at the origin. We also assume that notation V(x, 1) » 0 
implies that Conditions 2 and 4 of Theorem 8-1 are satisfied. 


Theorem 8-2 If there exists a scalar function V(x, t), with 
continuous first partial derivatives, satisfying the following conditions: 


(a) V(x,t) > 9 for all x Æ 0 in Q and all t 


V(0, 1) 2 0 for all t 
(b) V(x,t) «0 for all x #0 in Q and allt 
V(0,1)=0  forall: 


then the origin of the system of Eq. (8-8) is uniformly asymptotically stable. 

If the foregoing conditions are satisfied in the whole state space and either 
V(x,t)— œ as ||x|| 5 œ, or V(x, t) < —m « 0 for all ||x|| > M and 
some m, M > 0, then the origin is uniformly asymptotically stable in the 
large. 

If asymptotic stability is assured only for all possible motions starting 
within Q, then practical considerations may determine whether the size of Q 
is sufficient. 

To prove stability (but not asymptotic stability) of the origin the fol- 
lowing theorem can be applied: 


Theorem 8-3 If there exists a scalar function V(x,1£), with 
continuous first partial derivatives, satisfying the following conditions: 


(a) V(x,t)>0 for all x 40 in Q and all t 
V(0, 13) =0 for all t 

(b) (x,t) 0 — forall x £0 in Q and all : 
V(0,t)=0 forall: 


then the origin of the system of Eq. (8-8) is uniformly stable. 

This theorem states that the existence of such a V(x, t) function implies 
the uniform stability of the origin. It should be noted that V(x,t) <0 
along the trajectories means that the origin is uniformly stable but not neces- 
sarily uniformly asymptotically stable. Hence, in this case the system may 
exhibit a limit cycle operation. 


Theorem 8-4 Suppose that there exists a scalar function 
V(x,t), with continuous first partial derivatives, satisfying the following 
conditions: 


(a) V(x,t)>0 for all x Æ 0 and all : 
V(0, 1) —0 for all t 
(b) Y(x,t) <0 — forall x z 0 and all z 
V0,t)=0 . forall: 
Let E be the set of all points where V(x, t) = 0, and let M be the maximum 
invariant set contained in £. (A set M is said to be invariant if each trajectory 
originating in M remains in M for all t.) Then every solution of Eq. (8-8) 
bounded for t > 0 approaches M as t — oo. If, in addition, V(x, t) — oo as 
||x|| — eo, then each solution is bounded for t 20, and all solutions 
approach M as t > co. 
We say that x(t) approaches a set M as t approaches infinity if for each 
e > 0 there is a T > O with the property that, for each t > T there is a state 
p in M with ||x(t) — p|| < e; that is, for all t > T the states x(r) are within 
a distance e of M. If the maximum invariant set contained in E is the origin, 
X — 0, then we may state this theorem as follows: , 


Theorem 8-5 If there exists a scalar function V(x, 1), with 
continuous first partial derivatives, satisfying the following conditions: 
(a) V(x,t)>0 for all x = 0 and allt 
V(0,1) — 0 for all t 
(b) Y(x,?) x0 — forall x z 0 and all t 
V(0,?) — O — forall: 
(c) VIOC; Xo, to), t) does not vanish identically in t > tẹ for any f, and 
any X, ~ 0, where ó(t; Xo, to) denotes the solution starting from Xo 
at f, 


then the origin of the system of Eq. (8-8) is uniformly asymptotically stable 
in the large. 

This may be seen as follows: If V(x, t) is not negative definite but only 
negative semidefinite, then the trajectory of the representative point can 
become tangent to some particular surface V(x, t)= C. Since V($(t; Xo, fo), t) 
does not vanish identically in t > t; for any 1, and any x, # 9, the repre- 
sentative point cannot remain at the tangent point [the point which cor- 
responds to V(x, t) = 0] and therefore must move toward the origin. 


Example 8-7 Consider the following system: 


i RN (8-9) 
Xe = —XxXe— xi 
Clearly the origin is the equilibrium state. Let us consider the following V(xi, x2): 
V(xi, x2) = xi + 2xi 
Then E 
V(Gxi, x2) = 4xixs + 4x(— xa — xi) 
= —4x; [<0 
Let E be the set of all points where V(x1, x2) = 0 and let M be the maximum in- 
variant set contained in E, 
E = (Qa, x2) 5 V(x, xs) = 0} 
= (0,0) ; — œ < xı < + oo) 
If 
x. =0 
then 
X= —xi = or x =0 
Hence the maximum invariant set M is the origin x; = 0, x» = 0. Since V(x) — co 
as ||x|| > co, each solution is bounded for £ > 0, and we can conclude that all 


solutions approach M, the origin in this case, as 7 — co. Hence the origin of the 
system of Eq. (8-9) is asymptotically stable in the large. 


Finally, to prove instability of the origin we have the following theorem: 


Theorem 8-6 If there exists a scalar function V(x,1), with 
continuous first partial derivatives, satisfying the following conditions: 
(a) V(x, t) 20 for all x #0 in Q and all ż 
V(0, 1) —- 0 for all t. 
(b) Y(x,:) 20 forall x #0 in Q and all ż 
VY(0,2)-—0 forall? 
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or 
(a) V(x, t) <0 for all x = 0 in Q and allt 
VO, £):—0 for all t 
(b) V(x,t) -0 . forallx z 0 in Q and allt 
V(0,:2) -O . forallt 


then the origin of the system of Eq. (8-8) is unstable. 

This theorem states that if both V(x, t) and V(x, t) are definite and have 
the same sign, then the origin is unstable and the trajectories near the origin 
diverge to a limiting value or infinity. 

In concluding this section, note that in applying these stability theorems 
to a given system the stability conditions obtained from a particular V 
function are sufficient but may not be necessary. Furthermore, a Liapunov 
function for a particular system is not unique. Therefore, one should note the 
following: 


1. Failure in finding a V function to show stability or asymptotic stability 
or instability of the equilibrium state under consideration can give no 
information on stability. PI es 

2. Although a particular V function may prove that the equilibrium 
state under consideration is stable or asymptotically stable in the 
region Q which includes this equilibrium state, it does not necessarily 
mean that the motions are unstable outside the region Q. 

3. For a stable or asymptotically stable equilibrium state, a V function 
with the required properties always exists. 


8-5 STABILITY ANALYSIS OF LINEAR SYSTEMS 


In this section we first present applications of the second method of Liapunov 
to stability analysis of linear time-invariant systems. Then we give a proof 
of Routh-Hurwitz stability criteria. Finally, we present discussions of linear 
forced dynamic systems. 


Liapunov stability analysis of linear time-invariant systems 
Consider the following system: 
x= Ax . (8-10) 
where x is a state vector ( vector) and A is an n x n constant matrix. We 
assume henceforth that A is nonsingular; that is, its determinant [A] is 
nonzero. The stability of the equilibrium state of the linear time-invariant 
system can be investigated easily by use of the second method of Liapunov. 
The following is a basic theorem of stability analysis of linear time- 
invariant systems by means of the second method of Liapunov. 


| 
| 
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Theorem 8-7 The equilibrium state x = 0 of the system given 
by Eq. (8-10) is asymptotically stable if and only if given any positive definite 
Hermitian matrix Q (or positive definite real symmetric matrix Q), there 
exists a positive definite Hermitian matrix P (or positive definite real sym- 
metric matrix P) such that 

A*P + PA — —Q 
The scalar function x*Px is a Liapunov function for the system of Eq. 
(8-10). 


Proof We shall first prove that if there exists a positive 
definite Hermitian matrix P such that A*P + PA = —Q, then the equi- 
librium state of the system is asymptotically stable. Define 

V(x) = x*Px > 0 for all x #0 
as a tentative Liapunov function for the system of Eq. (8-10). Then V along 
any trajectory is obtained as 
V(x) = x*Px + x*Px 
= x*A*Px + x*PAx 
x*(A*P + PA)x 
= —x*Qx 


Thus 
V(xX) «0 foralx#0 
Hence the assumed V(x) is a Liapunov function and the origin of the system 


is asymptotically stable. l 
We shall next prove that if the equilibrium state x = 0 of the system is 
asymptotically stable, then there exists a positive definite Hermitian matrix 


P such that 
A*P + PA = —Q 
Referring to Prob. A-6-17, the matrix differential equation 


X= A*X + XA,  X(0-Q (8-11) 


. has the solution 


X = e^ Qe“ 
Integrating both sides of Eq. (8-11) from t = 0 to t = co, 


X() — X(0) = A*(f x dt) + (fx di)A 
and noting that A is a stable matrix and, therefore, X(co) = 0 we obtain 


-Q- A'(f xa) + (fr xa)& 


Let us put 


P =f X dt = | e*Qe^ dt 
0 40 x 
Note that the elements of e^' are finite sums of terms like et, teh, 
eoo, 07 eM where the A, are the eigenvalues of A and m, is the multi- 
plicity of X,. Since the X, possess negative real parts, 


s e*t Qe! dt 
0 
exists. Note that 
pte a et 069 d= P 
0 


Thus, P is Hermitian (or symmetric if P is a real matrix). We have thus shown 
that for a stable A and for a positive definite Hermitian matrix Q there 
exists a Hermitian matrix P such that A*P + PA — —Q. We now need to 
prove that P is positive definite. Consider the following Hermitian form: 


Prs f e*' Qe“ dt x 
0 


= { (ex) Q(e'x) dr > 0 for x #0 
—0 forx=0 


Hence P is positive definite. This completes the proof. 

In applying this theorem for stability analysis of linear time-invariant 
systems, we compute a Hermitian matrix P (or real symmetric matrix P) by 
equating the matrices 


A*P + PA and —Q 


element by element. This results in n(n + 1)/2 linear equations for deter- 
mination of the elements p,, = p;, of P. These simultaneous equations are 
soluble unless an eigenvalue of A or the sum of a pair of eigenvalues of A 
is zero. In other words if we define the eigenvalues of A as M, 35,35, . ..,each 
repeated as often as its multiplicity as root of the characteristic equation, and 
if for every sum of the two roots, X; + A, =Æ 0, then the matrix P is uniquely 
determined by the matrix Q. Note that if the matrix A is a stable matrix then 
the sums X; + X, are always negative. By applying Sylvester's criterion for 
X*Px to be positive definite; that is, 


dh m Du pi pua 
pu > 0, > 0; Pü Pa pal, 
on Ee Pu P32 Pa 
(Dis = Pj) 


we can determine whether the origin of the system given by Eq. (8-10) is 
asymptotically stable. These conditions are clearly alternatives to the Routh 
or Hurwitz stability conditions. 

Asymptotic stability can always be examined for a linear time-invariant 
system by solving a set of n(m 4- 1)/2 simultaneous algebraic equations for 
the determination of a Liapunov function. (In the case of linear time-varying 
systems or nonlinear systems, however, no straightforward methods are 
presently available for finding Liapunov functions.) 

One alternate form of Theorem 8-7 follows: 


Theorem 8-8 A necessary and sufficient condition for x = 0 
to be an asymptotically stable solution of Eq. (8-10) is that there exist a 
positive definite Hermitian matrix P (or positive definite real symmetric 
matrix P) satisfying the equation 
-A*P + PA = —I 
where I is the identity matrix. 


Example 8-8 Determine the stability of the equilibrium state of 
the following system: 
Xi mm ctc 2x» 
Xo = Xx% 4) 2 
The equilibrium state is the origin or x = 0. Let us first solve the following 
equation: 
A'P + PA = —I 
or Saai 


» | ia m x Hs d E a EN z 4 (8-12) 
—2 —4j|p pz Piz poe 1 —4 0 —1 
where we have used the fact that po, = p,.. If the matrix P turns out to be positive 


definite, then x’Px is a Liapunov function and the origin is asymptotically stable. 
Equation (8-12) yields the following three equations: 


—2pu + 2p: = —1 
—2pu — 5pis + pos = 0 
—4pis — pz = —1 


Solving for p’s, we obtain 


Pa = za, Pu = -% Dee 7 E 
Hence 
23 7 
60 60 
P= 
7 1] 
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By Sylvester's criterion, this matrix is positive definite. Hence we conclude that the 


origin of the system is asymptotically stable. A Liapunov function is 


23 E" x1 
, a 1 (23x? — 14 11x? 
Vx) = x’Px = [a xs] 4 u = o xi — 14xixs + 11x) 
60 60 | | *2 
and Vis given by 
V= —x?— x} 


Notice that in the asymptotically stable linear time-invariant system such as 
this, if Q is given to be a positive definite Hermitian matrix, then there exists a 
positive definite Hermitian matrix P. For example, if in this system Q is given as 


1 =l] +j 
a AN E 
—L-—J E 
then the matrix P that satisfies A*P + PA = —Q is obtained as 
3 1 m 
jó ^. Cx. 
P= e 
Se 3 
S Ss 5 


P is clearly positive definite. 


We shall next demonstrate that the following vector matrix differential 
equation: (8-5) 


x= Ax (8-13) 
where x is an n vector and A isan n X n constant matrix given by 
0 I 0 0 -.- 0 0 0 
— br 0 10. 0 0 0 
0 =r 0 TF =: 0 0 0 
A= E p as ‘ z ‘ 
0 0 0 0 spe = 0 1 
0 0 0 0 "ee n 0 —b, —b, 
is equivalent to the following scalar differential equation: 
ax diy Te 
dr + 4g +.--+ta,x=0 (8-14) 
where a;, . . . , a, are real quantities, provided that a’s and b’s are related by 
A A Ai-34 
b —A, ba b=-5%, ..., =e = (8-15 
Sis M e" Av: V 


az a2 a, 1 see 0 


A, =| @s G( m d; a 0 (re 12,2) 


Q»i-1i Gri-2  * * m. Oy 
(a, is replaced by zero if s > n) 


and A, As ..., A. are nonzero quantities, and provided further that 
x — xy. As an example, we shall demonstrate that the following equation: 


X — Ax (8-16) 


where 


is equivalent to 


2 
a + a, DI aL GE pax = 0 (8-17) 


where a,, a», and a;, which are real quantities, are related to b,, ba, and b, by 
Eq. (8-15). Since 
A 0 1 OT1[rx Xo 
X;,|—|—b 0 1 x| =| —b3x1 + Xi 
Xs 0 —b, —bhiix; —bixi — b: xX 
we obtain 
X, + bığı + (b: + bj) + bib:xı = 0 


Since b’s are 


As -— A; 
b, S A, b; ae A; b; d A, A; 
where 
a, 1 
A, =a, A, = = 4,05 — a; 
a; a, 
a l '0 
A; —|a,; à; Q, |= 0,0505 — aj 
0 0 a 
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where A;, As, ..., An are Hurwitz determinants 
a, 1 0 0 >- 0 
t 
| 
| 


or 


Q0; — a 
D; = à, þm SEE 


a a 


where we assumed A,A, = a, (a, à; — a) #0, by substitution of these 
values of b’s into the preceding third order scalar differential equation, we 
obtain 


Xy + aX + aX, + asx, = 0 


From this we see that if a, Æ 0, a, a; = a, then Eq. (8-16) can be modified 
into Eq. (8-17), and vice versa. 

The following theorem essentially proves Hurwitz stability criterion. 
(Since Hurwitz stability criterion and Routh’s stability criterion are equi- 
valent, the following theorem also proves Routh’s stability criterion.) 


Theorem 8-9 (8-2) Consider the linear time-invariant system 


x = Ax (8-18) 
where 
0 I 0 0 0 
—b, 0 1 0 0 
0 —b,4 0 1. e 0 0 0 
*- o - $ 3 : : x (a 
0 0 0 0 ... —h 0 l 
0 0 0 0 ss 0 —b —b, 
and b,, b», . . . , b, are real quantities. The origin of the system is asymptotic- 


ally stable if and only if 
b, = 0, 


(A is called Schwarz matrix.) 


Dic» 0, 2:2 5.20 


Proof Let us consider the scalar function V — x' Px as a 
possible Liapunov function, where 


bibi --- b, 0 om m 0 0 0 
0 bibs- -bni 0 0 0 
P= : 
0 0 b, bb, 0 0 
0 0 0 biba 0 


o 
o 
o 
o 

= 


| 


By a simple calculation, we obtain 


V = x'(A'P + PA)x = —2b x} 


which is negative semidefinite. By examining Eq. (8-18) we find V cannot be 
identically equal to zero unless x, = 0 (i = 1, 2, . . . , n). Hence the necessary 
and sufficient condition for the asymptotic stability of the origin is that 
x’ Px be positive definite. This means that all the elements in the main - 
diagonal of P must be positive; hence b, > 0 (i = 1, 2,..., n). This com- 
pletes the proof. 

If we choose b's according to Eq. (8-15), then Eq. (8-18) is equivalent to 
Eq. (8-14). For asymptotic stability of the origin of the system of Eq. 
(8-18) or Eq. (8-14), Theorem 8-9 states that b’s must be positive which, 
by Eq. (8-15), is equivalent to saying that A, (i= 1, 2, ..., n) must be 
positive. This proves Hurwitz stability criterion. 

It is noted that we can obtain the correlation between the Routh array of 
the following equation: : 


(—1)"|A — Al| = 0 
where A is given by Eq. (8-19) and the coefficients of powers of X in succes- 
sive principal minors of the determinant 
[A —AI| 
starting from the top left-hand corner. By comparing the recurrence rela- 
tionship involved between principal minors of orders i, i — i, and ji — 2 


and the recurrence relationship defining the Routh array, we can prove the 
following genera! result which is stated as a theorem: 


Theorem 8-10 (8-5) The first column of the Routh array of 
(—1)"|A —A1| =0 
where A is given by Eq. (8-19) consists of the elements 
l, by, be, bib bib, b,b3bs, bbib, 5, b3b5b;, 


the final (n + 1)th element ending in 5,..! 

We shall next present a theorem due to H. R. Schwarz. The proof is 
left to the reader. (Note that by use of Theorems 8-9, 8-10, and 8-11 we 
can immediately prove Routh's stability criterion.) 


Theorem 8-11 (8-2) Ifa real constant matrix C is similar to 


the following matrix A 


1See Prob. A-8-4. 
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0 I 0 0 0 0 
—b, 0 1 
0 —b,_, 0 0 0 
A= « E 
0 0 0.0 -.- 0 1 


0 0 Q0 0 sss —b, =), 


that is C = T^! AT (T: nonsingular), then the number of eigenvalues of 
C which have negative real parts is equal to the number of positive terms in 
the sequence b,, bı bz, . . . , bib: +++ bn, provided that none of the b, is zero. 


Example 8-9 Consider the following matrix C: 


—6 -1 —5S 1 —1 Q9[|O 1. OEE O í 
C= 0 3 1|2/0 OF SE NG 9: 055] Oh Og 
6 1 6 0 I OJO 1 3][0 1 0 
= TAT 
C is clearly similar to A where 

010 

A=|6 0 1 

Of 3 


(Note that C and A have the same eigenvalues.) Since 
bi = —3, b: = —1, b; = —6 
the sequence 


bi = —3 
bibo =3 
bibyby = —18 


has one positive term. Hence the number of eigenvalues of C (or A) which have 
negative real parts is one. In fact, the characteristic equation of C (or A) is given by 


A — 3A? — 7. + 18 = 0 
which has three roots 
Ai = 2, Az = 3.541, As = —2.541 


Hence we can confirm that only one eigenvalue has a negative real part. 


Linear forced dynamic systems Finally we shall present a 
theorem on the linear forced dynamic system. 


Theorem 8-12 Consider the following system: 
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x = A(t)x + B(t)u 
where x =n vector (state vector) 
u=r vector (control vector or forcing vector) 
A(t) —n*x n matrix 
BH =n Xr matrix 
If the fundamental matrix (z, tọ) of 
Dt, to) = A(t) P(t, to), Pl to, to) = I 


is bounded and also 
t 
lim | |I, to) 7 (t, t) BC) d 


is bounded, then all solutions x for bounded u are bounded. 


Proof The solution x(t) can be written as 
x(t) = D(t, to) x(t) + Xt, to) | : &- (t, t)B(t,)u(t,) dts 
it follows that 
|| x(t)|] < IEC, to) ||xCt0)|] 
+ J) I, 1:9 7(, s) BCA) [UCI dr 
< |I, to)|| | xCo) I 
+ lim f^ jer, 0)-%(6, BEI luto d 


Since u(t), ||d*(t, to)||, and lim ING to)! (ti, t) B(t,)|| dt; are bounded, 
there exist constants c;, C+, and c, such that 

laH] < cı 

IIC, to)|| < es 


t 
lim f llt, 0) Bs i) B(ts)]| d < es 


Then, we obtain 
||x(2)|| < esllxCto)ll + eres 
Hence we see that x(t) is bounded. 
Note that asymptotic stability of the equilibrium state does not neces- 
sarily imply the bounded output for a bounded input. For example, consider 
the following system 


1 
dose re X(fo) = Xo, Lb>—l 


which has the solution 


wm l5 
x(t) iF? Xo 
The origin of this system is clearly asymptotically stable. Now consider the 
following equation: 


r= xu, x(0) = x, (8-20) 


MIT 


where u is a constant input. ii Mis (8-20) has the solution 
x(t) = Tri" -+ Piue, 2) 


If u(t) = c, = nonzero constant, Mit x(t) — (¢,/2)t as t — oo, and x(t) 
of Eq. (8-20) is not bounded. 

Similarly, the bounded output for a bounded input does not necessarily 
imply asymptotic stability of the equilibrium state, x — 0, when the input is 
removed. For example, consider the following system: 

X 5x, x(0) = x, (8-21) 
which has the solution 
x(t) = e° Xp 


Clearly the origin of Eq. (8-21) is unstable. Since, however, for a constant 
forcing function u the following equation 


x-—x-cu (8-22) 
has the solution 
x(t) = e' x + u (e — 1) (t 2 0) 
if we choose u(t) such that u(t) = —x, = c,, then x(t) — —c,. The output 


x(t) of Eq. (8-22) is thus bounded for a bounded input w(t), but the 
equilibrium state of the free dynamic system is unstable. 


8-6 STABILITY ANALYSIS OF NONLINEAR SYSTEMS 


In a linear free dynamic system if the equilibrium state is locally asympto- 
tically stable, then it is asymptotically stable in the large. In a nonlinear free 
dynamic system, however, an equilibrium state can be locally asymptotically 
stable without being asymptotically stable in the large. Hence implications of 
asymptotic stability of equilibrium states of linear systems and those of 
nonlinear systems are quite different. 

If we are to examine asymptotic stability of equilibrium states of non- 
linear systems, stability analysis of linearized models of nonlinear systems is 


——— M — er ep ie 


completely inadequate. We must investigate nonlinear systems without 
linearization. Several methods based on the second method of Liapunov are 
available for this purpose (8-6). They include Krasovskii’s method for 
testing sufficient conditions for asymptotic stability of nonlinear systems, 
Schultz-Gibson's variable gradient method for generating Liapunov functions 
for nonlinear systems, Lur'e's method (8-7, 8-8, 8-9) applicable to stability 
analysis of certain nonlinear control systems, Zubov's method (8-6) for 
constructing domain of attraction, and others. 

In this section we limit our discussions to Krasovskii's method, Schultz- 
Gibson's variable gradient method, and Lur'e's method. 


Krasovskii's method Consider the nonlinear system 
x = f(x) (8-23) 
where X = n vector (state vector) 


f(x) = n vector whose elements are nonlinear 
fünctionsS Of X X3; ss s s KS 


The Jacobian matrix for the system is 


Wi M... 9] 
Ox; X: ax, 
Of, ofr af 
= Afi, - --. fa) -— ax, ET eee a : 
F(x) P m E i ^ » 
af. af, RT of, 
x, ax, ax, | 


In this nonlinear system there may be more than one equilibrium state. 
It is, however, possible to transfer the equilibrium state under consideration 
to the origin of the state space by an appropriate transformation of coor- 
dinates. We shall, therefore, consider the equilibrium state under consider- 
ation to be at the origin. 

We shall now present Krasovskii’s theorem. 


Theorem 8-13 (Krasovskii's theorem) Consider the system 
described by Eq. (8-23). Assume that f(0) = 0 and that f(x) is differentiable 
with respect to x;, i = 1, 2,. . . , n. Define 

f(x) = F*(x) + F(x) 
where F(x) is the Jacobian matrix and F*(x) is the conjugate transpose of 
F(x). [If f(x) is real, then F(x) is real and F*(x) can be written as F'(x). F(x) 
is clearly Hermitian (symmetric if F(x) is real).] If F(x) is negative definite, 
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then the equilibrium state x — 0 is asymptotically stable. A Liapunov 
function for this system is 


V(x) = £*(x)£(x) 


If, in addition f*(x)f(x) — œ as ||x|| — co, then the equilibrium state is 
asymptotically stable in the large. 


Proof Since f(x) is negative definite, the determinant of F 
is nonzero everywhere except at x = 0. (See Prob. A-8-13.) There is no 
other equilibrium state than x = 0 in the entire state space, or f(x) z& 0 for 
X Æ 0. Since f(0) = 0, f(x) #0 for x #0 and V(x) = f*(x)f(x), V(x) is 
positive definite. 

Noting that 

f(x) = F(x)x = F(x)f(x) 
V can be obtained as follows: 
V(x) = £*(x)f(x) + f*G)f(x) 

= [F(x)f(x)]*f(x) + £*(x) F(x) f(x) 

= £*(x)[F*(x) + FG9]f(x) 

= f*(x) F(x)f(x) 
Since F(x) is negative definite it follows that V(x) is negative definite. Hence 
V(x) is a Liapunov function. Therefore, the origin is asymptotically stable. 
If V(x) = f*(x) f(x) tends to infinity as ||x|| —> co, then by Theorem 8-2 the 
equilibrium state is asymptotically stable in the large. 

Notice that Krasovskii’s theorem differs from usual linearization 
approaches. It is not limited to small departures from the equilibrium state. 
Notice also that V(x) and V(x) are represented in terms of f(x) or x rather 
than in terms of x. -3 

The foregoing theorem for asymptotic stability in the large gives sufficient 
conditions for nonlinear systems and necessary and sufficient conditions for 
linear systems. An equilibrium state of a nonlinear system may be stable 
even if the conditions specified in this theorem are not satisfied. Hence in 
using Theorem 8-13 be careful not to give erroneous conclusions about the 
stability of a given nonlinear system. 


Example 8-10 (8-2) Consider the second-order system with two 
nonlinearities: 


Xi = fii) + f(x) 
Xo = xı + axe 


We assume that fi(0) = f;(0) = 0, and that fi(xi) and f2(xz) are real and differen- 
tiable. In addition, assume that 


Lala) + f(x)? + [xi + ax]? — oo as ||x|| > co 


(8-24) 
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Determine sufficient conditions for asymptotic stability of the equilibrium state 


x — 0. 
In this system F(x) is given by 


F(x) = EM jesi 
where fila) = e and — fí(x)- oh : 


Then F(x) becomes as follows: 
F(x) = F/(x) + F(x) 
H | 2fi(m) 1+ ital 
1 + fí(xo) 2a 

By Krasovskii's theorem if FG) is negative definite, then the equilibrium state 
x — 0 of the system described by Eq. (8-24) is asymptotically stable in the large. 
Hence if 

(1) fia) «0  forallxiz50 

(2) 4afíQa) — [1 + fi(x:) 7> 0 for all xı #0, x» #0 
then the equilibrium state x = 0 is asymptotically stable in the large. (These two 
conditions are sufficient conditions for asymptotic stability. It is evident that 


because stability conditions are completely independent of actual forms of the 
nonlinear functions fi(x) and f(x), these conditions are unduly restrictive.) 


Variable gradient method for’ generating Liapunov functions 
(8-10) For an asymptotically stable system, there are a large or even in- 
finite number of V functions. In fact, it is the existence of this large number of 
suitable Liapunov functions that gives the second method of Liapunov an 
advantage over other methods in stability analysis. 

The major difficulty in applying the second method of Liapunov to the 
stability analysis of nonlinear systems is the lack of a straightforward 
procedure for finding Liapunov functions. Krasovskii’s approach assumes 
V to be a Hermitian form or a quadratic form in f(x). Such assumption of 
a Hermitian or quadratic form is unnecessarily restrictive simply because 
a Hermitian or quadratic V function may not exist for a given system. 

One useful systematic approach is the variable gradient method of Schultz 
and Gibson. This method is based on the fact that if a particular Liapunov 
function exists which is capable of proving asymptotic stability of a given 
system, then a unique gradient of this Liapunov function also exists. Through 
the introduction of a variable gradient and auxiliary equations, this approach 
to some extent reduces the emphasis on the ingenuity and experience of the 
investigator. 

We shall first present basic theoretical considerations and then explain and 
illustrate the method by examples. 


Consider the system described by the following equation: 


x = f(x, t) 
where x is a state vector (n vector) and f(x, t) is an n vector, elements of which 
are functions of xi, Xə, .. . , x, and t. We assume the equilibrium state under 


consideration to be the origin of the state space. Let us denote a tentative 
Liapunov function as V. In this tentative Liapunov function we assume, for 


convenience, that V is an explicit function of x,, xs, ..., X» but not an 
explicit function of t. Then 
NE OV S. oV 
dii ER LA + 9, Fe 
V can be determined from VV, the gradient of V, as follows: 
Y —(VVyx 
where (V V)' is the transpose of VV. VV is given by 
eV vu. 
Ox, 
VV = ? —] 
aV 
0X VE, 


V is obtained as line integral of VV as 
y - f "(VVy'dx (8-25) 
0 


The upper limit here is not meant to imply that V is a vector quantity, but 
rather that the integral is a line integral to an arbitrary point in the state 
space located at (xi, x4, . . . , Xn). This integral can be made independent of 
the path of integration. The simplest path of integration is indicated by the 
expanded form of Eq. (8-25) to be 


Zi, (Le=Ty=+++=z,=0) 
V == f VV;dx, 
0 


9) 
V V. dx» 


D (CTLILTMLTLIPJLIEET LO 
0 


Ln, (2, = Ly, T2=T2,..., Tnet = Zn-1) 
J VV, dx, (8-26) 
where V V, is the component of VV in the x, direction. 
For a scalar function V to be obtained uniquely from a line integral of 
a vector function VV, the following matrix F formed by 2VV;/0x;: 


avy; oVV,; Ta 2VV, 
Ox, OX» OXn 
. | OV, VV: |... OVV. 
p=| on Ox; OXn 
aVV, OVV, |. OVV, 
Ox, Ox, OXn 


must be symmetric. In the case of n = 3 we have 


9VV,  0VV, VV, OVV, VV, | OVV, 


8x ^ Ox" Ox Ox, ? Oxs- O: 


The condition on the matrix F is thus a generalized curl requirement for the 
n-dimensional case. The problem of determining a V function which satisfies 
Liapunov's theorem is then transformed into the problem of finding a VV 
such that the n-dimensional curl of VV equals zero. Further, the V and V 
determined from V V must be sufficient to prove stability; that is, they must 
satisfy Liapunov's theorem. We first set VV is equal to an arbitrary column 
vector 


Qj Xi F dis Xo e+ + Qin X2 


Qoi Xy + dos Xo e+ + os Xn 


Vy = (8-27) 


-O;uxXi daasxicbese Ann Xn 


The a, are completely undetermined quantities. The a,; may be constant 
or functions of time ¢ and/or functions of state variables. It is convenient, 
however, to choose a,, as a constant or as a function of time r. Several of the 
4,, may be chosen to be zero or they are obvious from constraints on V 
imposed by the investigator, or they are determined from the curl equations. 

If the equilibrium state x — 0 of a nonlinear system is asymptotically 
stable, then we may be able to obtain a Liapunov function by the method just 
described by following the following procedures: 


1. Assume VV of the form of Eq. (8-27). 

2. From VV, determine V. 

3. Constrain V to be negative definite or at least negative semidefinite. 

4. Use the n(n — 1)/2 curl equations implied by the statement that F 
must be symmetric to determine the remaining unknown coefficients 


in VV. 
5. Recheck V, as the addition of terms required as a result of step 4 may 
alter V. 
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6. Determine V by Eq. (8-26). 

7. Examine the region of asymptotic stability. 

Note that failure to obtain a suitable Liapunov function by this method 
does not imply instability of the equilibrium state. (In fact no conclusions 
about stability or instability can be made.) Two examples are given to il- 
lustrate the preceding procedure. 


Example 8-11 Consider the following system: 


s 6xi 
EUM NUT. 
A (1 + xi)? i 
te HOw e 
(1 + xi) 


Determine the stability of the equilibrium state x — 0 of the system. 
Let us attempt to find a suitable Liapunov function by the variable gradient 
method. Let 


vv- ves + wi 
2X1 + 2X2 
Then V can be obtained as D 
; , 6aii xi 
y -(VVyx-- $20xi. 
(VVYx a+ xp + 2ai2x3 


6a 
+ (2an = ots) XiX: — 2anxi 


c - (4+ 2au ) xix 
+ x? G+ xyes? 
Here, there are a number of ways in which V might be constrained in order to prove 
stability. For the purpose of proving stability, Ý must be at least negative semi- 
definite. This can be accomplished by choosing, for example, ai: = a»; = 0 and 
41 = 2. With these values of a@;;, a12, and azı we obtain 


oe; SO 4x3 
aF a+ xi 


from which we sce that V is negative definite. The function V is then obtained from 
Zi, (Z2=0) Zs, (riri) 
V= r 2xidx; + E 2x2dx2 = xi + xi 
We see that V is positive definite. Hence V is a Liapunov function. Since V — oo 
as ||X|| —> co the equilibrium state x = 0 is asymptotically stable in the large. 


Example 8-12(8-11) Examine the stability of the equilibrium state 
x = 0 of the following system: 


Xi = X2 
Xo = Xa 


Xi = — (xı + bx2)® — cx; 


| 
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Let us attempt to construct a Liapunov function by the variable gradient 
method. Let the gradient of V be defined by 
QuXi + di2Xo + 013 X3 
VV —| anxı + anx: + 23X3 
Q31X1 + d32X2 + X3 
then the derivative of V becomes 
y -(VVYx 
= (an Xi + dioXo + dis X3) X2 
+ (an xı + doo Xo + 23X3) X3 
— (azıXı + az2X2 + X3)cXs 
— (as1Xi + dao Xo + Xi)(1 + bx) 


V might be constrained in a number of ways in order to prove stability. By 
looking at the equation for V we first choose 


423 = C, aa = c*, azı = 0, a3;=0 


Then VV is simplified as 


432 — C, 


ai Xi + di2X2 
VV-| anxı + x: + cxs 
CX» + X3 
The values of the a’s must be such that the following curl equations are satisfied: 


VV, 0VW. — 0VW OVV AV V2 OVV, 


QXa Ox, i Qxs ui Ox, 2 Óxs Oe (5-98) 
or 

€(aixi) ES O(ai» xa) 23 O(a»1 xı) 
Oxe xz xi 

(ai xi) O(ais x2) * 
Ox3 + Ox; x9 d "i 

Agnes) +e=c 
3 


From Eq. (8-29) it is clear that a», does not involve x3. As a trial, choose 
a = = (x1 + bx») 
Then 
lanx) _ 2 
E ^u = 3(x1 + bx») 
If we choose 


is = = (x1 + bx) 


e a i V E e e MM Hm TQ ———— e —— 


then, 


Ceu = 3G bx) 


If an is a function only of xı, then such choices of a»; and aj» would satisfy Eq. 
(8-29). Hence we choose VV as follows: 
auQxi)xi + + (xı + bx)? 


VVv= (8-30) 


(xi + bx) + cà xs cx 
CX» + Xs 


then the derivative of V becomes 
y —(VVyx 

= [ a Gs + E (xı + bx |x. 

b - = 

+ [Qa + bx)? + èx exs]xs 


+ (exa + x) [— (xı + bx) — cx] 
= (b? — cb*)x$ + 3(1 — be)xi x 

+ 3(b — b?c)xyx3 + (+ E c)xaxi 

+ oau(xi)xixe 
If we choose 

an(xı) = G — ip 
then 
V = x$(1 — bob xà + 3x7 + 3bx1 x2) 

Note that the quantity (3x1 + 3bxixs + b?x$) is positive definite, since 


3  X5b 


3>0, f 
1.55 b 


Hence for the negative definite condition on Ý, we have 


bc > 1 (8-31) 
VV defined by Eq. (8-30) can now be written as 
ee E E jy 
(c Lys + T (xı + bx) 
vw (x1 + bx) + c^ xs + cxi 
CXo + X3 


V is found by the following line integration 
y= É (VV) dx 


Ey, (z:9 2,20) Z2, (zi 4, 2,0) 
= | ext dx, + E $ UU [Qa + bx)? + cx] dxs 

Zy, (Ly =X, 2211) 
[ (cx: + X3)dxs 


0 


= 4$ + z (xı T bx) + lx. Tox 


Clearly, V is positive definite if 
b90, os 0 


Note that V(x) — oo as ||x|| — co. 

Since from inequality (8-31), V is negative definite if bc > 1, we conclude that 
Vis a Liapunov function under the condition that b > 0, c > 0, bc > 1. Therefore, 
the equilibrium state x = 0 is asymptotically stable if b > 0, c > 0, and bc > 1. 
(Note that these are sufficient conditions for asymptotic stability.) 


Lur'e's method We present next a stability analysis of non- 
linear systems by use of the first canonic form of Lur'e. A. I. Lur'e has 
studied, by means of the second method of Liapunov, a class of systems with 
one nonlinear element. An advantage of this approach is that V functions 
have been developed for nonlinear systems in the canonic form. Major 
disadvantages of this approach are that it requires a rather tedious job of 
finding a solution to a set of algebraic equations, and that this approach can 
be applied to only a rather limited class of nonlinear systems. In order to 
save tedious computations for a solution to algebraic equations, various 


r=0 
for 20 K 


-5 -ü3 


+e ee 
$-À sr ka Sr, 


Mo: Simple poles in the left half 
plane excluding the origin 


Figure 8-6 Nonlinear control system. 
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simplified stability criteria have been developed in the literature (8-72, pp. 
324-26) which would facilitate the stability analysis of nonlinear systems in 
the canonic form of Lur'e. 

Consider the control system shown in Fig. 8-6. It is assumed that G(s) 
involves only simple nonzero poles in the left half plane. The input-output 
characteristic curve of the nonlinear element is assumed to be a continuous, 
single-valued function. The output of the nonlinear element is /A(e), where 
e is the input to the nonlinear element. It is assumed that (0) = 0. The 
nonlinear element may consist of a controller and other elements. Such 
a system having only one nonlinear element can be described in the 
first canonic form. In the system shown in Fig. 8-6, it is asumed that the 
input to the system, r(t), is removed at time t = 0 and r(t) = 0 for all ¢ > 0. 


The first canonic form of Lur’e The nonlinear control sys- 
tem shown in Fig. 8-6 may be described in the following first canonic form 
of Lur'e: 

X,=Mx, + Ae) “(G=1,2,...,”) 


n 8-32 
e= > QiXi ( ) 
i=l 


where X, = canonic variable 
e = input to the nonlinear element 
h(e) = output of the nonlinear element [the input to 
G(s)] 
A, = ith pole of G(s) (A; can be either real or complex) 
a, = negative of the residue at pole s = A, 


Notice that é can be given by 
= Z Cx e: rh(e) 
n 
where r=— Da, 
t=1 
For real X's the corresponding canonic variables become real. For complex 


(s+4) (s+5) 
(s+ 1)(s+2)(s+3) 


r=0 


Ale) = e? 


Figure 8-7 Nonlinear control system. 


^ 
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conjugate A's, the corresponding canonic variables are complex conjugate 
pairs. 

As an example consider the system shown in Fig. 8-7. Equations for 
this system in the first canonic form can be obtained as follows: Since G(s) 
can be written as 


(s + 4Xs + 5) 


MA CE RV CES) CE EY 
BI WM 1 
B Ex PRATER 
we obtain 
À, = —l, A, = —2, A= —3 
a, = —6, @ = 6; a;=-—l 


r=—(a@,+a,+a;)=1 
Hence the system equations in the first canonic form is given by 
x, = —x, + h(e) 
X. = —2x, + h(e) 
Xx; = —3x; + h(e) 
= —6x, + 6x; — Xi 


(8-33) 


where hle) = e 


By direct computation 
é = 6x, — 12x, = 3x; = h(e) 


Once the numerical values for the coefficients Ai, @;, and r have been 
calculated it is possible to obtain stability information of this nonlinear 
control system. 

It is convenient to write Eq. (8-32) in the form of the vector matrix dif- 
ferential equation as follows: 


x = Ax + A(e)y 


(8-34) 
e — z*x 
X — n vector 
À — n x n diagonal matrix whose diagonal elements 
are Aj, As, ... , Aw Where the A, are negative real 
quantities and/or complex conjugate pairs having 
negative real parts 
1 


1 


where 


y —n vector = 


1 | 


e — scalar, the input to the nonlinear element 
h(e) — scalar, the output of the nonlinear element 


d, 
Gs 
z — n vector — 


Then é can be given by 


é= Ze x, — rh(e) = z*Ax — rh(e) 


= ZG — rh(e) — x* A*z — rh(e) (8-35) 
Sufficient conditions for asymptotic stability of the origin of the system 


given by Eq. (8-34) can be stated as follows: 


Theorem 8-14 (Lur’e’s theorem) The origin of the system 
given by Eq. (8-34) is asymptotically stable in the large if 


Q)r =-% a; >0 
(2) h(e) is continuous for all values of e, eh(e) > 0 for all values of e ¥ 0, 
i he) de — co as e| —^ co and &(0) = 0 


(3) there exists at least one set of a,, a, . 


, An Which satisfies the 
following matrix equation: 


jp s Q1; 
2 r a, E 2 2 A5 + Ji + Q Ai 0 
Q30j 
iuis CHEER Toe |_| 0 (8-36) 


OF dy T FP 0 


where a; is real if X; is real, and a, and a,,, are complex conjugates if 
A, and X44, are complex conjugates. 
If the first condition r > 0 is modified to r > 0, then the origin is pe 
totically stable in the large if V does not vanish identically in ż Z2 0 for 
any X, # 0. 


Proof If there exists at least one solution to Eq. (8—36) satisfy- 
ing the condition that a, is real if à; is real and a, and a,,, are complex 


conjugates if >, and X,., are complex conjugates, then Eq. (8-36) can be 
rewritten as follows: 


2/r a— a Fie 


aid, 


x, 4 0 


24/ r a. a — 2 X XC + GÀ» = 0 (8-37) 


To rem 0 


Equation (8-37) can be simplified as follows: 
2/ r Y - 2Sy + A*z — 0 


where 
a dd _ 4010s d did, 
: à cr XX TX 
ds QoQ, oF DA dd; 
v= > S= Ag TM Ay + Aa Ag +X 
- x [PI^ = d, d» s; 
“s intu XX "MES 
Hence we obtain 
2/ r v* + 2y*S--z'A— 0 (8-38) 


Let us choose the following V function as a tentative Liapunov function: 
V(x, e) = x* Sx + r h(e) de (8-39) 
0 


It can be shown that this V function is positive definite. (Refer to Prob. 
A-8-27.) V possesses continuous first partial derivatives. V can be obtained 
by use of Eq. (8-35) as 
V = X*Sx + x* Sx + h(e)é 
= x*(A*S + SA)x + h(e)(y*Sx + x* Sy + x* A*z) — rh'(e) 
Noting that 
y*Sx = x*Sy and x*A*z = z*Ax 


!Notice that if, for example, A: and A» are a complex conjugate pair, then À1 = Xe, 
4| = d», and a, = d». Hence the first row of Eq. (8-36) can be written as 


24/r d» — 2È UIT +a = 0 


which is the second row of Eq. (8-37). Similarly the second row of Eq. (8-36) is identical 
with the first row of Eq. (8-37). 
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V becomes 
V = x* (A*S + SA)x + A(e)(2y* Sx + z*Ax) — rh'(e) 


Since 
did, did; +++ Aan 
did, a: +++ an 
A*S + SA = — 
anai dads +++ 4d, 
= negative semidefinite! 
we obtain 


x*(A*S + SA)x = —x*vv*x 
Hence V becomes 
V = — x*vv*x — rh*(e) + A(e)Qy* Sx + z*Ax) 
= — [v*x + V7 h(e)]*[v*x + Wr he) 
+h(e) (v r v*x + A/ r x*v + 2y*Sx + z*Ax) 
Noting that 
Wa x*y: 

we have 

V = — [v*x 4- A/r h(e)]*[v*x + /7 hle] 

+ h(e)(24/ r v* + 2y*S + z*A)x 

Using Eq. (8-38), V is simplified into = 

V = —[v*x + /7 h(e)]*[v*x + 17 hle) (8-40) 


From Eq. (8-40) we see that if r > 0, V is negative semidefinite but does 
not vanish identically on any solution curve. Hence the assumed V function 
is a Liapunov function. Moreover, V(x, e) co as ||x|| + |e|] —^ co. Hence 
if r >Q then by Theorem 8-5 the origin is asymptotically stable in the 
large. If r = 0 and V does not vanish identically in ? 2 0 for any x, + 0, 
then by Theorem 8-5 the origin is asymptotically stable in the large. This 
completes the proof. 

Notice that G(s) cannot have poles at the origin. The characteristic of the 
nonlinear element can, however, be quite general within the restrictions 
stated in the theorem. Notice also that this theorem gives only sufficient 
conditions for asymptotic stability of the origin of nonlinear systems defined 
by Eq. (8-34). 


1See Prob. A-8-27. 
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Example 8-13 Consider the system shown in Fig. 8-7. The system 


equations in the first canonic form are given by Eq. (8-33) and written thus 


žı = —xı + h(e) 
Xe = —2x: + h(e) 
X3; = —3x3 + h(e) 
where : e = —6x| + 6x2 — xs, hle) = e? 


In this system the equilibrium state is the origin and 
À; = —1, Ae = —2, A3 = —3 
a, = —6, Q: = 6, Qs = —1 

r= —(083- Q: + 03) — 1 


eh(e) = e zo 
[ he) de + 00 as |e| > co 


Equation (8-36) becomes 


= — Gi au ma) 

2a. 2( i a a) + 6 0 

2a, — 2( — && — € _ 66) 12 =| 0 (8-41) 
D asai did — i 

2a; 2( - ; 5 3E 3 0 


Let us check if there exists at least one solution to Eq. (8-41). Equation (8-41) 
can be simplified as follows: 


2S qu: 
a a2 mm as 
a(1+%4+%4%)=6 (8-42) 


Notice that obtaining a solution to Eq. (8-42) is rather time-consuming. [For this 
reason, various simplified versions of Eq. (8-36) are used for the determination of 
asymptotic stability.] After some computations we find that the following set of 
a’s will satisfy Ec. (8-42): 

a = —3.45, a = 5.83, a; = —1.40 


This set of a’s satisfies the condition that a; must be real if X; is real. Hence, by 
Theorem 8-14, the origin of the system is asymptotically stable in the large. 

Notice again that in any given nonlinear system nonexistence of a set of the a; 
that satisfies the condition does not imply any conclusions on stability, since 
Lur'e's theorem gives a sufficient condition but not a necessary and sufficient 
condition for asymptotic stability. 


i 
t 
| 
i 
i 
| 


8-7 STABILITY ANALYSIS OF DISCRETE- 
TIME SYSTEMS! 


In this section we present stability analysis of discrete-time Systems. As in 
the case of continuous-time systems, asymptotic stability is the most impor- 
tant concept of the stability of equilibrium states of discrete-time systems. 
If the system is described in terms of state variables, asymptotic stability of 
equilibrium states of the discrete-time system obtained by discretizing the 
continuous-time system is equivalent to that of the original continuous-time 
system. 


First we present stability theorems for discrete-time Systems based on the 
second method of Liapunov. 


Theorem 8-15 Consider the following discrete-time system 
x((k + 1)T) = f(x(kT)) (8-43) 
where X — n vector 


f(x) — n vector with the property that f(0) — 0 
T — sampling period 


" Suppose that there exists a scalar function V(x) continuous in X, such 
that 
(a) V(x) >0 for x 40 
(b) AV(x) < 0 for x ze 0 
where AV(x(kT)) 
= V(x((k + 1)T)) — V(x(kT)) 
V(f(x(kT))) — V(x(kT)) 


(c) (0) — 0 

(d) V(x) — oo — as||x|| > 00 
Then the equilibrium state x — 0 is asymptotically stable in the large and 
V(x) is a Liapunov function. Note that in this theorem the condition (b) 
may be replaced by 

(b) AV(x) <0 for all x 

(b") AV(x) does not vanish identically for any solution sequence {x(kT)} 

satisfying Eq. (8-43) 

This means that AV(x) need not be negative definite if it does not vanish 
identically on any solution sequence of the difference equation. (The maxi- 


mum invariant set is the origin itself. See Section 8-4.) 


! This section follows closely R. E. Kalman and J. E. Bertram (8-2). Numbers in paren- 
theses refer to the references listed at the end of this chapter. 


The proof of this theorem is quite similar to that of Theorem 8-1 
and is therefore omitted. 
The following two theorems apply to discrete-time linear time-invariant 
systems. 
Theorem 8-16 Consider the discrete-time linear time-invari- 
ant system 


x((k + 1)T) = Gx(kT) 


where x is an n vector and G is ann X n constant matrix. A necessary and 
sufficient condition for the origin x = 0 to be asymptotically stable is that 
all the eigenvalues of the matrix G be less than unity in absolute value. (Note 
that this is equivalent to saying that ||G*|| —^ 0 as k — co. We leave the 
proof of Theorem 8-16 to the reader.) 


Theorem 8-17 Consider the discrete-time linear time-invari- 
ant system 


x((k + 1)T) = Gx(kT) (8-44) 
where x is an n vector and G is an n X n constant matrix. The equilibrium 
state x — 0 is asymptotically stable if and only if, given any positive definite 
Hermitian matrix Q (or any positive definite real symmetric matrix Q), 
there exists a positive definite Hermitian matrix P (or positive definite real 
symmetric matrix P) such that 


G*PG — P = —Q 
Then 
V(x) = x*Px 
is a Liapunov function for the system of Eq. (8-44), with 
AV(x) = —x* Qx 


(The proof of this theorem is identical with that of Theorem 8-7, and is 
therefore omitted.) 

Notice that the Hermitian form x* Px or quadratic form x'Px is positive 
definite if and only if the absolute values of all the eigenvalues of G are 
smaller than unity. 

For discrete-time systems a norm ||x|| may be used as a Liapunov 
function. This fact is stated as a theorem in the following. Before we present 
the theorem, however, it is necesary to define a term contraction. A function 
f(x) is said to be a contraction if 


Tf£CGOII < Ixl] 
for some set of x ~ 0 and some norm, with the property that f(0) = 0. 
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Theorem 8-18 Consider the following discrete-time system 
x((k + 1)T) = f(x(KT)), f(0) = 0 (8-45) 
where x is an n vector and f(x) is also an n vector, elements of which are 
functions of the x;. Assume that f is a contraction for all x and some norm. 


Then the equilibrium state, the origin, of the system of Eq. (8-45) is asymp- 
totically stable in the large and one of its Liapunov functions is 


V(x) = |x| 


Proof If we choose V(x) = ||x|| which is positive definite, 
then AV can be written as 


AV(x(kT)) = V(f(x(kT)) — V(x(kT)) 
= |f|] — Iixll 
which is negative definite by definition of a contraction. Hence V(x) = ||x|| 
is a Liapunov function, and the epum state x = 0 is asymptotically 


stable in the large. 
We next present a theorem fora discrete-time forced dynamic system. 


Theorem 8-19 Ifall solutions of 
x((K + 1)T) = Gx(kT) 


where x is an n vector and G is an n X n constant matrix, tend to zero as 
k — oo, then all solutions of the following system: 


x((k + 1)T) = Gx(kT) + Hu(kT) (8-46) 


where H is an n x r constant matrix, are bounded, provided that the forcing 
vector u(kT), an r vector, is bounded. 


Proof Since u(kT) is bounded, there exists a positive constant 
c such that 


lluk T| < c for k 220; 1,2... 
The solution of Eq. (8-46) is given by 
x(kT) = B(KT)x(0) + (kT) 2 97 GT)Bu(G — DI) 


where S(kT) = 
Hence, 


S(kT)o-'(jT) = G*? 


IIx DII « IIG I ]IxC0)]] + c 2; lIGIF-^11RUI 


< IIGIIIxCOII + lim e Zlcir-mu 
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Since the origin of the free dynamic system is asymptotically stable, there 
exist constants m and r, (0 < r < 1), such that 


0 < |GE < mr* 
Then 


k 5 . k X 
lim Y |G] < lim X mrt- = m — 
k=æ jal k=æœ j=l 1—r 


Therefore 


lix Ge) < mill) + elf — 


We thus have proved that ||x(AT)|| is bounded. 


8-8 APPLICATIONS OF SECOND METHOD 
OF LIAPUNOV TO SYSTEM DESIGN! 


In a conventional way of designing control systems a control system is 
first designed and then its stability is examined. A different approach first 
formulates the conditions for stability and then designs the system within 
those limitations. For a large class of control systems a direct relationship 
can be shown between Liapunov functions and generalized performance 
indexes used in synthesis of optimal control systems. If the second method 
of Liapunov is utilized to form the basis for the design of an optimal con- 
troller, then: this insures an asymptotically stable system and the system 
output will be continually driven toward its desired value. 

In this section we first focus our attention on the relationship between 
Liapunov functions and transient responses of the system and then discuss 
the relationship between Liapunov functions and generalized performance 
indexes. 

The following discussion is based on the fact that Liapunov functions give 
a measure of the state of the system at any given instant, or they can be 
regarded as a measure of distance in the state space. For an asymptotically 
stable system for which a Liapunov function has been obtained, the latter 
can be used to estimate the rapidity of transient response. Let us assume 
that the equilibrium state is the origin of the state space and define 


_. Pn A) d 
t= (8-47) 
in some region of the state space excluding the origin. For uniformly asymp- 
totically stable systems, a Liapunov function with negative definite V can be 


found, at least in principle. In the following analysis we, therefore, exclude 


1 This section follows closely R. E. Kalman and J. E. Bertram, (8-2). 


the case where V(x, t) = 0 for x 0. Equation (8-47) gives an indication of 
how fast the system state approaches its equilibrium state. Integrating both 
sides of Eq. (8-47) with respect to t 
t 
V(x, t) = Vx» t)e te 
where V(X, to) corresponds to the initial value of V starting at x = x, at 
t= fy. If we write the minimum of the ratio — V(x, t)/ V(x, t) as nmin, OT 


— min| — Ves 2] (8-48) 
then 

Wx, t) € —nmin V(x, f) 
and 


V(x, t) S V(x, t;)e "ni t-t (8-49) 


Interpreting V as the distance from the origin, it is clear that Eq. (8-49) 
gives an estimate of how fast equilibrium is approached. 

From Eq. (8-48) we see that 1/»,;, corresponds to the largest time con- 
stant relating to changes in the Liapunov function V. Since V(x) may be 
expanded in series starting with quadratic terms in x, this time constant 
1/7 min is about half the conventional time constant defined for the system. The 
value of nmin thus can be considered as a kind of figure of merit. Larger 
values of 7min correspond to faster response. Since there may be a number of 
Liapunov functions for a given system, a number of different values of 7min 
may be obtained. The largest value of 7min should be considered to represent 
the figure of merit. It is, however, usually not known how to construct the 
Liapunov function which yields the largest value of 7. The value of ņ in a 
sense gives a measure of "goodness" of the Liapunov function for a given 
system, since this value can be interpreted as the upper bound on the time 
constant of the system. 

Since a Liapunov function for a given system may be obtained without 
solving the system differential equation, it may be possible to find this 
figure of merit for a nonlinear system whose solution cannot be found direct- 
ly. For a nonlinear system, 7 in may depend on the state of the system. 
However, this approach is useful only if the Liapunov function available is 
of a sufficiently simple form. 


Calculation of 7 for linear time-invariant systems (8-2) Con- 
sider the following linear time-invariant system: 


X = Ax 
where X = n vector (state vector) 
À — n X n constant matrix 


Suppose that the eigenvalues of A have negative real parts. Then a Liapunov 
function and its time derivative are given by 
V(x) — xX*Px and V(x) = —x*Qx 
where Q = —[A*P + PA]. A definition of min equivalent to Eq. (8-48) 
is then 
si = min {x*Qx; x*Px — 1} 

The minimization can be performed by use of the Lagrange multiplier 
technique. Let p, be the Lagrange multiplier. Then, minimizing x* (Q — uP)x 


with respect to x implies that the minimum occurs at a value X,,;, of x such 
that 


(Q — BP)Xmin = 0 (8-50) 

Therefore, | 
X" min QXinin = LX* min PXmin = y > 0 
which is a minimum if p is. But, by Eq. (8-50), w is an eigenvalue of the 
matrix QP-'. Hence 7min is the minimum eigenvalue Amin of QP^!, or 
Amin = Amin (8-51) 
The Liapunov function or V function can thus be used for the direct synthe- 
sis of dynamic systems. 
Example 8-14 Consider the second-order system: 
Xi 0 LEES en 
EMUPEPIN 


This system corresponds to a second-order system with the damping ratio £ — 0.5 
and the undamped natural frequency , = 1. Let us obtain an estimate of the 
largest time constant of the system by use of a V function. 

Let us assume 


V = x’Px 
where 
A/P + PA = —I 
or noting that P is symmetric, namely ps; = pi» 
Pee d 
t1 —ljipis pee Pi pesji—-1l —1 0 -—1 


from which we obtain - 


| 2 Ex 

Pu Ppi 2 2 
E 

pa P22 pl 1 
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Hence 
yV-xPx- Fle + x2)? + 2xi + oxi] 
and 
y = —xx- —(xi + x?) 
Thus, 
S o 2(xi + xi) 
7 Vi 3xi + 2xixi + 2x3 


Differentiating 7 with respect to xi 


On _ 4xixi — 4xixi — 4x2 
Ox: (3x1 + 2xixs + 2x3)? 


Putting é7/8x, = 0, we obtain 
xi = 1.618 x; or xi = —0.618 x; 
Substituting these two values of x; into the expression for 7 we obtain 
min = 0.553, max = 1-447 - 


and the upper limit on the convergence time is thus 


1 1 


EA 70333 ^ 1.81 (sec) 


This is an estimate of the convergence time for V(x). Since V(x) involves quadratic 
terms of x;, V(x) converges about twice as fast as x;(t) and thus the upper limit on 
the time constant for x(t) would be 


7wm 0.553 


Comparing this figure to the time constant of the given system 


1 
Tz =2 
j Con 
we find that the system time constant is less than the upper bound previously 
predicted namely, 


2 < 3.62 


Thus, we expect that possibly some other V(x) would give a closer estimate of the 
time constant. Notice that it is always not less than the actual system time constant. 

7» and Ymax can also be obtained from Eq. (8-51). Let us first find the 
eigenvalues A; of QP-', or solve 


[QP-! — M| = 0 
Alternatively, using I instead of Q solve for the eigenvalues A; of P-! or solve 


II —AP| =0 
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Jn the present example, solve 


e 
| 
e 


X 
The result is 
Ai = 1.447, A» = 0.553 


Hence 
Amin = 0.553 and Nmax = 1.447 


This solution, of course, agrees with the solution obtained by directly computing 
the maximum and minimum values of —V/V. 


We shall next discuss the relationship between Liapunov functions and 
generalized performance indexes. Suppose that the system is given by 


x= x 
where X — n vector 


A =n X n constant matrix whose eigenvalues 
have all negative real parts 


and the performance index we wish to minimize is given by 
i " x*(1)Qx(1)dt 
to 


where Q is a positive definite (or semidefinite) Hermitian matrix [or a posi- 
tive definite (or semidefinite) real symmetric matrix]. For the present system 
we can choose a Liapunov function and its time derivative as follows: 


V(x)=x*Px and  V(x)- —x*Qx 


where the positive definite Hermitian matrix P (or positive definite real 
symmetric matrix P) is determined from 


A*P + PA = —Q 


Then noting that x(co) = 0 and V(0) = 0 the given performance index can 
be written as 


f I x*(t)Qx(oOdt 

=— f V (x)dt 

= —V(x(oo)) + V(x(t9)) 
= V(x(tQ)) 


= x*(()Px(t;) 


Thus the given performance index equals the value of the Liapunov function 


V(x) = x*Px at t= t. This relationship between Liapunov functions and 
quadratic or Hermitian performance indexes is quite useful in synthesizing 
optimal control systems. Since we shall discuss this matter in Chapter 9, 
here we give a simple example illustrating the usefulness of Liapunov 
functions in the optimization of control systems. 


Example 8-15 (8-2) Consider the second-order system given by 
Xi 26%) +x= 0, € 2-0 (8-52) 


where £ is an undetermined constant. We wish to minimize the performance index 
V(x, £) defined by 


Va, 6) = [7 Axar 


where 
Q h 9] 220 
> Eä 0 aT hz 


Determine the value of £ so as to minimize the performance index. 
Equation (8-52) can be rewritten as 


or 


where 


Since Q is a positive definite (or semidefinite) real symmetric matrix we can choose 
à Liapunov function as follows: 


V(x) — x'Px with V(x) = —x'/Qx 
where the positive definite real symmetric matrix P is determined from 


A'P + PA = —Q 


Solving for P we obtain 


P(E) = 


E 
2 
t 


tu] 


Since V(x, £) is equal to x/(0)P x(0) 


V(x, £) = £xN0) + [7] [x2(0) + x3(0)] + 2 (0) x0) 


The value of £( M) which minimizes V when x;(0) = 0 is obtained as follows: 
Eol) = d 
from which 
Sop (0) = i 


This is a well-known result. This type of optimization (minimizing [s xi dr) yields 
à System which is too oscillatory. The integral squared error criterion has been 
known to yield insufficient damping and hence to be impractical. If, however, p is 
chosen to be unity, 


P 5 = 0.707 


This value of ¢ is considered to be quite satisfactory. Hence, we see that 


or minimizing 
f Gd + x3)dt 


may be considered a better error criterion than 


or minimizing 


8-9 CONCLUDING COMMENTS 


The preceding sections of this chapter have presented a reasonably detailed 
account of stability analysis of dynamic systems and have illustrated many 
Stability and design problems to which the second method of Liapunov 
is most useful. 

The second method of Liapunov proves to be particularly powerful in 
the investigation of stability analysis of certain nonlinear systems where 
conventional criteria cannot give answers. 

Stability analysis by the second method of Liapunov is simply a problem 
of finding a positive definite V- function (Liapunov function), whose time 
derivative, taken in the direction of the solution of a system, is nonpositive. 
The major problem in applying this method to nonlinear systems is the 
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construction of Liapunov functions. Methods presently available for con- 
structing Liapunov functions are applicable to relatively simple systems only. 
Ingenuity is still required to apply this stability criterion to systems of other 
than simple types. We should note again that Liapunov theorems on stability 
of nonlinear systems furnish only sufficient conditions. Failure to construct 
a Liapunov function for a given system does not imply instability. 

The second method of Liapunov has another useful feature in addition 
to stability analysis; it can be used for synthesizing optimal control systems 
in the time domain. 

Liapunov functions have been determined for numerous specific dif- 
ferential equations and for certain differential equations of general types.' 
The accompanying exercise problems and solutions include some examples 
of the specific systems for which Liapunov functions are known. 


EXERCISE PROBLEMS 
AND SOLUTIONS 


Problem A-8-1 The closed loop transfer function of a single-input 
single-output system is given by 
C= K 
R x(t +s+1)s+2)+K 


where K is the gain of the system. By use of Routh’s stability criterion determine 
the range of K for asymptotic stability. 
Solution The characteristic equation is obtained as 
sé + 35° + 357+ 2s+ K=0 
The Routh array of coefficients is 


s 1 

g* 3 2 0 
s + K 

& a TK 0 

s? K 


For asymptotic stability, K must be positive and all the coefficients in the first 
column must be positive. Therefore, 


14 
g^K»20 


! Readers interested in a more detailed account of the second method of Liapunov should 
see (8-6). Those interested in stability (other than the second method of Liapunov) of 
differential eauation svstems should refer. for example. to (8-13). Readers interested in 
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(When K — 14/9, the system becomes oscillatory and mathematically the oscillation 
sustains at a constant amplitude.) 


Problem A-8-2 Determine the range of K such that the equilib- 
rium state of a linear system whose characteristic equation is given by 


A -KNLA -XATISO 
is asymptotically stable. 
Solution The Routh array of coefficients is 


At 1 1 1 
AN K 1 0 
X: E 1 0 
i! EGET 4 
K—1 
A? 1 
For asymptotic stability, the conditions are 
K>0 
K-1>0 
1-gE >0 


From the first and second conditions K must be greater than one. For K > 1 the 
term 1 — [K?/(K — 1)] is always negative, since 


KicK -tetka 
ker o Ee op 


Thus, thethree conditions cannot be fulfilled simultaneously. Therefore, there is no 
value of K allowing asymptotic stability of the equilibrium state of the system. 


Problem A-8-3 Show that the Routh’s stability criterion and 
Hurwitz stability criterion are equivalent. 


Solution If we write Hurwitz determinants in the following tri- 
angular form: 


23 


then the Hurwitz conditions for asymptotic stability become 
Ai = anan +++ Qi > 0 (P. 1,2,5: 3) 


which are equivalent to the conditions 


We shall show that these conditions are equivalent to 
a,> 0, by > 0, c, > 0, c.. 


where a;, bi, c1, . . . are the elements of the first column in the Routh array. 
Consider, for example, the following Hurwitz determinant: 


Q, A3 as A? 


The determinant is unchanged if we subtract from the ith row k times the jth row. 
By subtracting from the second row a/a; times the first row, we obtain 


Qi, Q3 as az 


Qoo 93, An 


A4 e 
a a3 as 
ao a» ay 
where a= a 
Qo» = Q — a a 
a 
ao 
d»; = d4— ~a 
23 EN Tia 
[ 
Qs, = Ag — — az 
Qi > 


Au Q5 Q5 a; 


A, = 
a 035 as 
0 âs Gas 
^. a 
where Âs = d — — às 
Qi 
a ao 
044 7 04 — — 45s 
a 


Next, subtracting from the third row a/a times the second row yields 


Qi 03 As Q0; 


N Q»» Q»3 Qu 
0 Q33 az 
0 âs du 


ai 
where 033 = a3 — ae an 


a 
ax = as — — 24 
a22 


Finally, subtracting from the last row â;3/az; times the third row yields 
an @3 as ar 


Q22 è Az Ang 


A; = 
0 0 Q33  Q34 
0 0 0 ass 
where Qa = Gas = Mis 54 
ü35 


From this analysis, we see that 
A4 = A11 022033444 
As = 1.422433 
Ao = dud» 
TAL = an 
The following Hurwitz conditions for asymptotic stability 
A, > 0, A: > 0, A; > 0, Ay > 0, 
reduce to the conditions 
ay 0, ax > 0, 433 > 0, a > 0, 


Referring to the Routh’s array, we see that 


ân = 

Qa — 09 — — 03.0 D 
a abi — aib 

as = a; — |a = et im =e 
Q22 bi 

A a beer — bic 

au = Âu — Bay, = SO t= dy 

433 C1 


Hence the Hurwitz conditions for asymptotic stability become 
a, > 0, bi 0, eci 0, dı 0, 


Thus, we have demonstrated that Hurwitz conditions for asymptotic stability can 
be reduced to Routh's conditions for asymptotic stability. The same argument can 
be extended to Hurwitz determinants of any order and the equivalence of Routh's 


stability criterion and Hurwitz stability criterion can be established. 


Problem A-8-4 (8-5) Show that the first column of the Routh array 


of 
N+ aA"! + aA"? +++ + Gnd + an =0 


is given by 


he iby A. 
ty Ar RO LIE or 
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where 
a 1 0 0 0 
ü3 Q9 ay 1 . 0 
Ar-—las a a a +++ 0 
A-1" . . LLLI ar 


a, =0 ifk>n 
Solution The Routh array of coefficients has the form 
1 a A4 dg *** Gn 
Qj dg dg + 
bi bz bs 


The first term in the first column of the Routh array is 1. The next term in the first 
column is a; which is equal to A;. The next term is b; which is equal to 
aia: — d _ Az 
Qa, Ai 


The next term in the first column is c; which is equal to 


aia — @3 Q1d4 — as 
————3|a;—a1| ——— 
bias — aibs a a 


bi EE — 2 


ai 


..Q1dsd5 — a3 — aia, + d1ds 
a,Q2 — A3 


In a similar manner the remaining terms in the first column of the Routh array can 


be found. 
The Routh array has the property that the last nonzero terms of any columns 


are the same, that is, if the array is given by 
Go az: Q4 ag 


Qa, Q3 as ar 


bi b: bi 
Cr Co C3 
di d; 

€1 €» 

f 

gı 
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then 


and if the array is given by 


Ci C2 0 


then 
ag = b; = di = f: 1 
In any case the last term of the first column is equal to an, or 
d, = An-14n = An 
Asi /— Ana 
Thus it has been shown that the first column of the Routh array is given by 
ho à. && E os em 
Problem A-8-5 Show that if X and Y are functions of u and v; 
u and v are functions of x and y, then 
OX, Y) Au, v) | WX, Y) 


alu, v) x,y) 905») 


where 
ox OX 
ax, ¥)_| 9x oy 
xy) | ay ay 
‘Ox dy 


AX, Y), u,v) _| u w| | Ox ay 
uo) Ox») | ay 2Y| | a 2» 
Qu æ | | ax Oy 
OXdu , OXdv 0XOu , 3X 3v 
ðu ðx Ovdx Oudy ' dv dy 
"| @Y@u , 0YOv 2YOu , 0YOv 
Gu dx" Ovex Gudy dv dy 
ax OX 


_|% @y | _ XX, Y) 
ay ay| xy) 


Problem A-8-6 Suppose that X and Y are functions of x and y: 

Show that if at least one of 0X/0x, OX|Oy, ƏY|əx, 0 Y|Gy is nonzero and 
ax, y) 
then there exists a functional relationship between X and Y, namely, there exists 
a function f such that either 
Y,y)—-f(X605,») or XG, y) =f(V, y)) 

Solution Without loss of generality we can assume that 0X/dx + 0. 

X = X (x, y) can be solved for x, or 


x = h(X, y) 


Then we obtain 
Y = Y(A(X, y), y) 
Thus X and Y are functions of X and y. Then 


o | [ax ax 
aX, Y) _ AX, Y) | HX y) _ ôx Oy 
ax, y) AX, y) Ox, y) ay oY x x 
aX ay 
AGE, OE 
y Ox 
Note that by assumption 
px Y d ax 
Sos. 0 and Du #0 
Hence we obtain 
oT. 
i 0 
From this we conclude that Y = Y(ACX, y), y) does not involve y. Hence 
Y —f(X) 


Problem A-8-7 A general solution F(x, y) = c, where c is a constant, 

of the following first-order differential equation 
y! — f(x, y) 

is called an integral of this differential equation. Prove that there exists a functional 
relationship between two integrals F and G of the same first-order differential 
equation: 

Solution Put 

F(x,y)=a,  GQy-b 

where a and b are arbitrary constants. Assume that y is a general solution of the 
differential equation corresponding to constants a and 5. Differentiating F and G 
with respect to x, we obtain 


Hence 


This equation holds true for an arbitrary general solution y(x) at point (x, »(x)). 
[If the initial condition is chosen properly, point (x, y(x)) can be an arbitrary point 
in some domain.] Hence from Prob. A-8-6, there exists a functional relationship 
between F and G. 


Problem A-8-8 Suppose that X and Y are functions of x and y, 
namely X — X(x, ») and Y = Y(x, y). Prove that if at point (xo, yo) 


eX, Y) 
Ox, y) 


then there exist functions F(X, Y) and G(X, Y) defined in the neighborhood of 
point (X(xo, yo), Y(xo, yo)) such that 


x= F(X, Y) y=G(X,Y) 


Solution By assumption we can assume 2 X/8x = 0 at point (xo, yo). 
Solving X = X(x, y) for x in the neighborhood of point (xo, yo) we obtain 


x ec AC) 


Then, Y = Y(A(X, y), y). Y is thus a function of X and y. Therefore, X and Y are 
functions of X and y; and X and y are functions of x and y. Hence, we obtain at 
point (xo, yo) 


#0 


0x 


a(x, y) AX,y) Ox, y) 2» ax 
from which we obtain 


oY 
5 ^? 


We can, therefore, solve Y = Y(h(X, y), y) for y and obtain y = G(X, Y). Substi- 
tuting y — G(X, Y) into x — A(X, y), we obtain 


X FX Y) 
This completes the proof. 


Problem A-8-9 (8-2) - Assume that x(t) is the solution of the follow- 
ing equation: 
X = f(x, u, t), —oo«rt«oo 


504 STABILITY ANALYSIS 
corresponding to a fixed choice of u. Show that for fixed u, deviations from the 
motion x(t) are a free dynamic system. 
Solution Let 
y=x+z 
then 
y=H=x+z 
= f(x + z, u, t) 
= f(x, u, £) + g(z, t) 
Since x(t) is a solution of the given equation it follows that 
z = g(z, t) 
g(0,t) = 0 for all t 
Hence, for fixed u, deviations from the motion x(t) are a free dynamic system. 
Problem A-8-10 (8-1) Consider the following system: 
X1 = aex, — xs — axj 
X. = x; + aex2 — ax} 
where a > 0 and e is a small positive constant. Show that the origin is unstable. 


The origin may, however, be regarded as "asymptotically stable" in the large, if 
we disregard small oscillations of amplitude less than /2e around the origin. 
Solution Near the origin the system can be linearized as follows: 

X1 = aexi — xo 

X2 = xi + aex: 
The characteristic roots of this linearized system are 

Ai = ae +j, Az = ae — j 
Since ae > 0, the origin is unstable. 4 j 
We can, however, show that the origin may be regarded as if “asymptotically 


stable” in the large if we disregard small oscillations (a limit cycle) within a small 
neighborhood of the origin. Let us take 


V= + 
then V can be obtained as 
V = —2a(xi + xf) + 2ae(x? + x2) 
Outside the circle U with its center at the origin and radius /2e ma have 
A/2e < Vx? + xi 


and 
ý< — 2a| et *x)- l6 x] = — a(x} - xt «o 


Hence outside the circle U, we have V > 0 and V < 0. All solutions starting 
from the outside of the circle U approach a limit cycle located inside the circle U 
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as t — co. Thus, the origin may be regarded as “asymptotically stable" in the 
large, if we disregard small oscillations of amplitude less than ^/2e around the 
origin. 
Problem A-8-11 Referring to Theorem 8-7 prove that the relation- 
ship 
A*P + PA = —Q 
is preserved under a transformation of coordinates. 
Solution Let 
x = Ty 
where T is an n x n nonsingular matrix, then 
y = T-ATy = Ayy 
where T"'AT—A, 
We thus have 


A = TA|T^! 

and 

A*P + PA = (TA, T-9)*P + P(TAi T7!) = —Q 
or 

T*(T-)*A;*T*PT + T*PTA, T-'T = —T*QT 
Therefore, 

AFP; + P:AÀ; = —Qi 

where Pi-T T*"eT and Qj; = TOT 


Problem A-8-12 Determine the stability of the equilibrium state of 


the following system: 
NE —2 s el 
| x | AL LANES EMI 


Solution The equilibrium state is the origin, or x = 0. Solving the 
following equation: 


Ec tb eid PEA 
[— 1:553, M Dis Poe Bis pzjl—1 +j —3 
-1 0 
-| 0 - j 
we obtain: 
4pu + (1 — jpu + (1 + pis = 
(1 — J)pu + 5pis + (1 — j)pz 


(1 + Jpu + Spi + (1 + J)p22 
(1 — J)pis + (1 + J)Pi2 + 6p2: 


(8-53) 


ll 
= O OQ = 


Notice that since pı, and ps2 are real, the second and the third of Eq. (8-53) are 
equivalent. They are conjugate to each other. From Eq. (8-53) we obtain 


O MC ee -2 
pu = -F> pre = ra +J) Pz = 4 


or 
3 1 ; 
* -4-Jy 
1 1 1 


which is positive definite. Hence we conclude that the origin of the system is 
asymptotically stable. 


Problem A-8-13 Show that if 
F(x) = F*(x) + F() 


where 
0 Ar ... OR 
Ox, Ox OXn 
F9-|: o: : 
afa Of» ... fn 
Ox, Ox» Ox, 


is negative definite, then the determinant of F(x) is nonzero for x # 0, or 
det F(x) + 0 for x zz0 
Solution By use of the relationship 
y*F(x)y = [y*F(x)y]* 
we have 
y*F(x)y = y*[F*() + Fooly 
= y*F*(x)y + y*FGOy 
= [y*F(x)y]* + y*FGOy 
= 2y*F(x)y 
Since F(x) is negative definite, F(x) is also negative definite. Hence 
det F(x) + 0 for x #0 
Problem A-8-14 By the use of Krasovskii's theorem (Theorem 8-13) 
prove that the equilibrium state x — 0 of the system defined by 
Xx; = —3xi + x 
Xe = X1— xs — xi 


is asymptotically stable in the large. 


Solution In this problem 


3x, + xs 
f(x) = ; 
Xy Xs —.x$ 
and 
FG) u 3 1 
wy 
1 —1—3xj 
Hence 


F(x) = F'(x) + F(x) 


a. 2 
oN - 


Since F(x) is negative definite, the equilibrium state x = 0 is asymptotically stable. 
In addition, 


f’(x) f(x) = (—3xi + x)! + (xı — x: — xi) S00 — as ||x!| co 
The equilibrium state is, therefore, asymptotically stable in the large. 


Problem A-8-15 (8-20) Prove that the function V(x, y), the solution 
to the following partial differential equation, 


SE AG») + FE Ax») = $e) — Vos y) (8-54) 


where (x, y) is a quadratic form and is positive definite, is a Liapunov function 
establishing the asymptotic stability of the origin x = y = 0 of the following 
system: 


x = fix, y) 


8-55 
y= f(x,y) "TM 


where the equations of the linear approximation are asymptotically stable. It is 
assumed that the solution to Eq. (8-55) is continuous for —co < t < co. (This is a 
simplified version of Zubov's method of construction of Liapunov functions.) 


Solution Because the roots of the characteristic equation of the 
linearized system have negative real parts, the leading term of the series expansion 
of V(x, y) is a positive definite quadratic form. Thus, V(x, y) is positive definite in 
some neighborhood of the origin and also V(0, 0) — 0. Equation (8-54) gives the 
derivative of V(x, y) along the integral curves of the system, Eq. (8-55), and since, 
near the origin, Eq. (8-54) reduces to 


we see that dV/dt is negative definite in the neighborhood of the origin. In fact, 
dV/dt is negative whenever 0 < V < 1. From Theorem 8-2 V(x, y) is a Liapunov 
function establishing the asymptotic stability of the origin. 
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Problem A-8-16 (8-2) E. A. Barbashin obtained a specific stability 
criterion for the third-order system given by the following equations: 
Xi X 
X2 = Xs i . (8-56) 
ž = — f(x) — g(x2) — aixs 


where f (0) = g (0) = 0 and f (xı) and g (x2) are differentiable. 
Prove that the equilibrium state x = 0 of the system is asymptotically stable in 
the large if 


(1) Q1 0 
Q) f(x) > 4 0 for x, #0 


3) e gon) — f'(x)»0  forx x0 


where 
f(x) = Se) 


and a Liapunov function may be written as 
V(x) = a, F(x) + f(x)xs + G(x2) + 4 (a Xo x3)? (8-57) 


where 
F(x:) zb f(x)dx and GOJE DE dixi 


Solution In view of (2), f’(0) > 0 and F(x;) > 0 for x; z 0. Fur- 
ther, by (1) and (3) it follows that G(x;) > 0 for x» + 0. For x» = 0, Eq. (8-57) is 
nonnegative and is 0 only if x; = x = 0. For x, 0, Eq. (8-57) becomes as 
follows: 


[2G(x2) + f(x1)x:]° 


S? seen [7 {algia — fad} xe dxe ax; 
a 0 


G(x2) 
(aix + xi 
— 


The first and third terms in the preceding expression for V(x) are clearly non- 
negative. The second term is positive because the inner integral is positive and 
F(xi) > 0 for x; z 0. Hence, 


Vix) > 0 for x #0 
and 
V0) = 0 
The time derivative of V is found as 


Vix) = — [8889 — ra] 
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Because of (3) 
Wx)<0  foralx-0 
V0) — 0 


It is clearly seen that V(x) does not vanish identically along any trajectory starting 
at xo Æ 0. [Since V(x) = 0 only if x» = 0, we see from Eq. (8-56) that V(x) identi- 
cally equal to zero corresponds to x; = x» = x; = 0.] Hence, Eq. (8-57) is a 
Liapunov function for the system of Eq. (8-56). In addition, we see that 


V(x)—9o as ||x|| > 00 


Hence by Theorem 8-5, the equilibrium state x = 0 of the system is asymptotically 
stable in the large. 
Notice that Eq. (8-56) is equivalent to 


Xi aX + g(xi + f(x) =0 


If we let f(x1) = a3x;, and g(xi) = a»xXi, then this last equation becomes as fol- 
lows: 


Xi + ağı + aži + ax = 0 
The three conditions for asymptotic stability are 
(1):a; > 0 
(2) aa >0 
(3) a1a.—a;>0 


(Note that these three conditions imply a, > 0.) Notice that these conditions are 
simply the Routh-Hurwitz conditions for asymptotic stability of the linearized third 
order system. 


Problem A-8-17 (8-20) By use of the variable gradient method con- 
struct a Liapunov function for the following system: 


X1 = — xı + 2x? x 

X) = — x 
Solution Let the gradient of V be defined by 
re + it 


2X1 + 2X2 


VV 


The derivative of V is then given by 
y -(VVyx 
= (auxi + ai xz)xi + (azn xı + 2xo)X2 
= —ayuxi + 2auxixs — aXX: 
+ 2auxixi — anxixs — 2xi (8-58) 
Let us put, as a trial 


a= 1 and a: = $1 = 0 


Then Eq. (8-58) becomes 
V = —x2(1 — 2xix) — 2x3 
V is negative definite if 
1—2x1x. > 0 (8-59) 
Thus, Eq. (8-59) is a constraint on x; and x». The gradient V V becomes 
x): 0 
0+ «d 
We see that the curl equation is satisfied. V is obtained as 


vr-| 


zi; (%2=0) Zn (xi—24) 
V= f xidxi + h 2x0 dxe 
xi 


2 


+ x3 


V is clearly positive definite. From this V function, we can say that the origin of the 
system is asymptotically stable. Figure 8-8(A) is a plot showing the region for 
which V is positive definite and V is negative definite. 


(A) (B) 


Figure 8-8 Regions for which V is positive definite and V is negative definite. 
(A) V cx (B) V i ax t 
In order to illustrate how the choice of ai, a1», and az; affects the resulting V 
function, let us choose 
2 =x xi 


cay (1— xix)" -— (1— xix») 


Then, 
V = —2x? — 2x? 
V is negative definite in the entire state plane. VV becomes 
2x1 Xi Xe 


Vv= a = xix)? E (1 m xix) 


It follows that 


xi (1— xi xy 
OVV; _ 3x} — xix 
Ox — (1 be! xs) 


Thus the condition for the curl equation is satisfied: V is obtained as 
Zi, (Z2=0) 2x l xix NONO E 
sae mte uer [e Ee ne Jos 
o (1—xx) (- xx) dxi4- á it — aa + 2x2 |dxs 
n 
= ]— X1X2 


+ x3 (8-60) 
From Eq. (8-60) V is positive definite if 1 — x,x. > 0. The region 

> Kies (8-61) 
includes the origin. From this V function we can say that the origin of the system 
is asymptotically stable. Figure 8-8(B) shows a plot of the region given by inequality 
(8-61). This is the region for which V is positive definite and V is negative definite. 


It is seen that the second V function gives a larger region, for which V is posi- 
tive definite and V is negative definite, than the first V function does. Hence 


is a better choice between the two V functions. 


Problem A-8-18 (8-21) Prove that the origin of the following sys- 
tem is unstable: 


X; = —2xixexs 
ONE X2 2 
X: = Xix. 
2 2135 1X5 
* Xa 2 
Xs = + xix 
EF ee 


where t > 0. 


Solution The equilibrium states of the given system can be found by 
equating X;, X», and X; to zero. Clearly, they are given by (xı, 0, 0), xı being any 
constant. We shall examine stability of the origin. After some inspection, choose 
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then 


Zi, (zi72,,2570) 


fF cvvydx = f" anit, «f 2xsdxs 


Lay (ximmmi—z) / ga] ) 
«f (Lec 2x dx; 


2(t + 1) 


= xt ba a + 14-3 


X2X3 


If we choose 


+ 2¢ + D + 1) 


"ug 4 


V-xictxibe zn 224+ 


t+1 


re rae +x) 


=x% + — (+h 


nu 


then V is positive definite for t > 0. Since this V function is time-varying, V is not 
equal to (VV)'x. V is given by 


y -fU.qvvys 
2 aj 44 
= [a++ t+ 8 axat xt + 1) 68 + 24) 
Em 0.5 2 o} 
=h E5 teta] 29 7 sd > Cee ta 


Thus P > 0 except on the x; axis where V = 0. By Theorem 8-6 the origin of the 
system is unstable. 


Problem A-8-19 (8-22) Find conditions for stability of the equilib- 
rium state of the following system: 


X1 = X. 
Xo = a(t)xı + ar(t)xe 
Solution The equilibrium state is clearly the origin. Let us assume 
v= i "E d 
boi xi + Das Xo 
then 
(VVYX = buxixs + bixi + bunai(t)xi + biaxt)x; 
+ byias(t) xi xo. + bogai(t) xi xo 
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Let us choose bı: = b; = 0, then 
(V Vx = buxiXxe + bzzas(t)xz + bos ai(t) xi xo 


Let 
bu + bos ait) = 


By choosing bı, = 2, and assuming a,(t) # 0, we obtain 


2 
b 2 = | on, = 
P air) 
VV can then be written as 
2xi 
VV = 2 
a(t) = 


Since 


ij (Z2=0) zu) — 2 
f 1 2xidxi + f x2dX_ = x} — 


1 
: a a" 


let us choose 


mw 
yes: a(t) H 
Then 
dt) . — 9€ š 
V= zait 70 I7 a 
= - (220 _ «s 
a(t) i) 


If a(t) < 0 for t > to, then V is positive definite. If 


_ 2a(t) , ai(t) 
ait) TM 0) <0 for t >to 


then V is negative semidefinite. Hence, by Theorem 8-3 conditions for stability 
may be obtained as follows: 


a(t) « 0 for t > to 


ppe 
_ 2a(t) , å:(t) 
a(t) * ax <0 for t > 1% 
Problem A-8-20 (8-23, 8-24) Consider the problem of stability of 
a homogeneous atomic reactor with constant power extraction. The reactor equa- 
tions are 


d = o— IL 
qos P= 2r alis 
s yt 


D 
[] 


where P(r) is the instantaneous reactor power, T(t) is the temperature, œ the temp- 
erature coefficient, 7 the average life of a neutron, and e the heat capacity. Equa- 
tions (8-62) can be rewritten as 


z = Le — 1) = f(a) 
These are obtained by applying the transformation of coordinates 
log P = 2, T-z 
to the system, Eq. (8-62). Clearly the origin z, = zə = 0 is the equilibrium state. 


Examine stability of the origin by constructing a Liapunov function by means of 
the variable gradient method. 


Solution Let the gradient of V be defined by 


vy = ien + vi 
02121 + 22 
The values of the a’s must be such that the following curl equation is satisfied: 
OV _ OVV2 
PES oz, 

or 

dan Q(aiszs) _ O(ao121) 

ma ee 7 
We may choose ai» = a»; = 0 and aj, = aii(zi). Then 
y -(vVvvyz 


= [au(z))] 21 (-4 z) +z lc — 1) 
If we choose 
= T A 71 
au(z)- ( Je 1) 
then, 
y= ae | 71 — 22, ZI ao = 
y- s Ut Dt D=o 


With this choice of a,,(z,), VV can be written as 


vy - |a - ? 


Zə 


V is found by line integration as follows: 


V =f” C z — 1)dz, + E is^" Zo 
ae 0 


0 


zi 


2 
Since 7 and e are positive quantities, if œ is positive, then 


=a- 1) + 


V>0 for Zi, 22 #0 
V=90 forz = ze = 0 


Thus V is a positive definite scalar function. We conclude that the equilibrium state, 
the origin, is stable. From this it follows that the solution P = 1, T = 0 of the 
system of Eq. (8-62) is stable for 7 > 0, e > 0, and « > 0. 


Problem A-8-21(8-10) Determine the stability of the origin of the 
following system: 


X= xe 
Žž = —x:— xi 
Solution An example of possible Liapunov functions is 
V = xt + xi + 2xixe + 2x} 
Its time derivative is 
V = —2xt — 2x} 
The origin of the system is clearly asymptotically stable in the large. 


Problem A-8-22 (8-20) Determine the stability of the origin of the 
following system: 


Xi = —2xi + 2x} 
Xo = —Xs 
Solution An example of Liapunov functions is 
V = 6x? + 12x3 + 4xixi + xi 
Its time derivative is 
7 = —24(x? + xi) 
The origin of the system is clearly asymptotically stable in the large. 


Problem A-8-23 (8-20) Determine the stability of the origin of the 
following system: 


ži = HX, + x: + x(x} + xi 


Xs = —X1 — X: + x(x xi) 
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Solution An example of possible Liapunov functions is 
Vi= x? + x} 
The time derivative of V is : 
V = —2xt — 2x3 + 2d + xb? 
Hence, the region for which V is negative definite is the inside of the circle 


defined by x? + x3 = 1. From this V function we can conclude that the origin of 
the system is asymptotically stable. 


Problem A-8-24 Determine stability of the origin of the following 
system: 


MENT e 2x» 
^*—0-xy + xy 


Solution An example of Liapunov functions for this system is 


2 
xi 2 
+x 
l+xi. °° 


and V is given by 
fe-. looo 4x 
+x (+x)? 
The origin of the system is asymptotically stable. 


Problem A-8-25 (8-10) Obtain conditions for stability of the origin 
of the following system: 


Xi = xX 
ž = —b(xi)a — a(xi, X2)x2 


Solution A Liapunov function for this system is 


V= 2 [be dx x1 
and its time derivative is 
V = —2a(x,, x2)x3 
Conditions for stability obtainable from this Liapunov function are 
2 E b(xi)xidxi -- x3 20, | a(xi, x) 20 
Problem A-8-26 (8-11) Obtain conditions for asymptotic stability of 

the origin of the following system: 

X1 = X2 

Xs = X3 


X3 = — A2X2 — ay X3 — f(x) x1 


STABILITY ANALYSIS 517 


Solution An example of possible Liapunov functions is 


aan 


p. 
p 


p 


Xa 
SU: a[ f(xi)xidxi- aao X1 X2 


2 
x 2 
+ (ai + an) + dsXiX3 + di Xo Xa + X3 


. (G2X1 , xX x3 M E 
= (5 irs F Js) + ay) | fx dxi 
LEE: 
; MEE i 
and its time derivative is 
V = — [asf(xi)xi + 2/63)xixs + aixi] 
Conditions for asymptotic stability obtainable from this Liapunov function are 


a 7 0, a> 0, f(x) > 0, Qd» — f(x) >0 


Problem A-8-27 Show that the following V function is positive 
definite: [Refer to Eq. (8-39).] 


V=x*S + fac )d 
= xXx T e)ae 
where 
f "e)de >0 le is defined by Eq. (8-32)] 


x = n vector 


__ a (00108 ... Uds 

Ai + An Ai + Ào Aa + An 
aaa 220202 ENT 

i= Xo + Ai Xe + M Xe + An 
ana; nas 55 


Aat — Aet NaF Na 


and each A; possesses a negative real part. 
Show also that 


A*S + SA 
is negative semidefinite, where 
^X 0 
Az 


Solution Since the second term of V is positive semidefinite, we need 
to prove that x*Sx is positive definite. Notice that since each X. possesses a negative 
real part, we obtain 


NEC NE ARE n Mt 
ENSE fae Yay e«t) dt 


Hence x*Sx can be written as 


x*Sx — D Y (āe (ane) Xi x, dt 


uA 
= (7X ace 80 X (cet) dt 
Oi=1 k=l 
= IN » aiet x; dt > 0 forx~0 
i=l 
=0 forx=0 


Thus x*Sx is positive definite. It follows that V is also positive definite. 
Next, we shall show that A*S + SA is negative semidefinite. Notice that 


^i 2. M aia M d.a 
MFA ^ MMC Arc XR 
A*S = — 
Aaa E An FA E Ty Tn 
Ai Xa An 
Te KI a AG. 
SA = — 
Ai = As 2A Jn » 
mt” kee (ORS i 
Therefore, we obtain 
QQ, Q,Q2 did, 
A*S 4 SA = 428, dod» d» 
Gna, d,» ae An Qn 
Let us put 
Q = —(A*S + SA) 
Then, 
x*Qx = È È Giar)sixe = |X (x)|t2 0 


Hence Q is positive semidefinite. Therefore, A*S + SA is negative semidefinite. 


Problem A-8-28 Consider the system governed by the difference 
equation 


x((k + 1) T) = f(x(k T), u(kT)) 


Assuming that u is a constant vector, or u — c (where c is a constant vector), find 
the equilibrium state x,. 


Solution The equilibrium state has the property that if the initial 
state is x(0) = x,, then repeated iterations of the transformation do not result in 
any change of state. Hence, 


x(nT) = x, 
for all n > 0. Thus x, can be found from 
X, = f(x., c) 
Obviously, the equilibrium state is a function of c. 
Problem A-8-29 (8-2) Consider the system 
x((k + 1)T) = H(x(kT)(kT) 


Assume that there are positive constants c;, c», . . . , c, such that either 
n 
(a) max { M £t s(x) j Zi for all x 
z i jz21€j 
or i 
m Ci 
(b) max (X. huc} «1 for all x 


Show that in either case H(x)x is a contraction for all x and therefore the equili- 
brium state of the system is asymptotically stable in the large. 


Solution In case (a), define the norm by 
\|x|| = max feilxil) 


then 


IIHGOxI = max (e A G0x,) < max{ È £l 9| eit) 


< max { $ 3 lh} m max (c;|x;]) < max tcilx;l) 


= |Ix!| 
which verifies that H(x)x is a contraction. 
In case (b), define the norm by 


Ixi = X exi 
Then 
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n n n n 
IGOxI| = È e| X hess] < E È ol esl 
ii j=l íz1Cj jz1 


< max{ X ct h|} X cjlx;| < Soules 
= |įxl| 
which yields the desired result. 
Problem A-8-30 Consider a discrete-time system 
x((k + 1)7) = Gx(kT) 


where x is an n vector and G is an n X n constant matrix. Show that the scalar 
function x*Px where P is a positive definite Hermitian matrix (or a positive 
definite real symmetric matrix) is a Liapunov function, if P — G*PG is positive 
definite. 


Solution Let 
V(x) — x*Px 
Then V(x) is positive definite and AV(x(kT)) is given by 
AV(x(kT)) = x* (kT)G*PGx(kT) — x*(kT)Px(kT) 
= —x*(kT)Qx(kT) 


where Q = P — G*PG = Hermitian (or real symmetric) matrix 
Hence if Q is positive definite, 


AV <0 for all x #0 
Thus, V(x) — x*Px is a Liapunov function if P — G*PG is positive definite. 
Problem A-8-31 Consider the following system: 
X — Ax, xO = ce 
where x is an n vector and all the eigenvalues of A have negative real parts. Evaluate 
the following performance index: ri 
J= f x*Qxar 


where Q is a constant positive definite (or positive semidefinite) Hermitian or 
real symmetric matrix. 
Suppose that the system equation is given by 


Xi 0 1 0 Xi 
X2 |= 0 0 1| | xe 
X3 —1 -2 -—ajlix: 


and the performance index is given by 
J= a x'xdt 
0 


Determine the value of the parameter a (where a > 0) so as to minimize the per- 
formance index J when the initial condition is given by 
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xi(0) 
x(0) = c = 0 
0 


Solution Let 
we d (yx 
x*Qx = a Px) 


where P is a constant positive definite Hermitian or real symmetric matrix. 
Then we obtain 
x*Qx = —x*Px — x*Px = —x*A*Px — x*PAx 
Hence 
A*P + PA = —Q 
By the second method of Liapunov we know that if A is stable, then for a given Q 
there exists a P that satisfies this equation. Since the eigenvalues of A have negative 
real parts, A is stable and we can determine the elements of P. J can be evaluated as 
Js f x*Qxar = —x*Px|" 
= —x*(co)Px(co) + x*(0)Px(0) 


Since all the eigenvalues of A have negative real parts, x(co) — 0. Therefore, we 
obtain 
J = x*(0)Px(0) = c*Pc 


To determine the parameter a in A of the third order system, we first determine 
P form 
A'P + PA = —I 
Noting that P is symmetric, we have 
0 0 -1)[pn pm pis pu pi ps 0 I A 
1 0 —-2||ps po pz3|+|p2 pe pz 0 0 | 
0 1 -—ajipms ps ps pis pes psl]bi—1 —2 —a 


2 0 0 
= 0 —1 0 
0 0 —1 


Notice, however, that 
J= c'Pc 
pu pu ps|[xi(0) 
—-[x(0 0 0] pi? po? pz 0 
Pis puo P33 0 
= pixi(0) 


Hence we need know only pıı. By a simple computation pi; is found as 


—G Sa I 
l zn 4a 2 
Therefore we obtain 
a+ 5a— 1l 
* 4da—2 1(0) 
To minimize J we determine a from 
dJ 4d? — 4a— 6 _ 
Rt ©. eee 
from which 
a = 1.823 or —0.823 


Notice that d?J/da? > 0 if a = 1.823. Hence the optimal value of a (where a > 0) 
that minimizes J is 1.823. 

The eigenvalues of A designed in this way have negative real parts because the 
designed system satisfies Liapunov’s theorem for asymptotic stability. To confirm 
this let us use Routh’s stability criterion. A can be written as 


0 1 0 
A= 0 0 1 
=i —2 14.823 


Hence the characteristic equation becomes 


[A — AI] = —As —1.8234? —2..— 1 =0 
By constructing the Routh array of coefficients 
Wd 2 
AUS qigase "I 
x 2145 90 
A ok 


we confirm that the eigenvalues of A have negative real parts. 


PROBLEMS 


Problem B-8-1 Consider the following equation: 
At + 13A? + 59A? + 107A + 60 = 0 
Determine the number of roots whose real parts are greater than —1. 


Problem B-8-2 Suppose that X and Y are functions of x and y. 
Prove that if there exists a functional relationship between X and Y, then 
a(x, y) 


Problem B-8-3 Construct a Liapunov function for the following 
linear time-invariant system: 


Xi = Xo 
X» = —6x| — 5x: 
Problem B-8-4 Consider the system 
Ži = X2 
Xo = — 0 Xi + Worx. 


where € > 0 and o, > 0. By use of the second method of Liapunov prove that the 
origin of the system is unstable. 


Problem B-8-5 Consider a Hermitian matrix A and the character- 
istic polynomial 


lA — AIJ = (=) IA" — BA! + BAN? + --- + (—)*5] 


Prove that a necessary and sufficient condition that A is positive definite is that all 
the b; (i = 1, 2,..., n)-be positive. 


Problem B-8-6 Consider the system 
x = A(t)x + B(x, t) (8-63) 


where x is an 7 vector and A(t) is continuous and bounded for t > to and B(x, t) 
is continuous for x near the origin and for £ > to, and 


IB, Dl]. o 
ixi 


uniformly as ||x|| O0 for ¢ > to. If all solutions of the following equation 
x = A(t)x 
approach the origin as pape, the same is true for all solutions of Eq. (8-63). 
Problem B-8-7 Consider the following system: 
X + a(x? + 1)X + (x + bx) = 0 
a 0, b>0 
Show that the origin is asymptotically stable in the large. 
Problem B-8-8 Consider the system 
x = f(x), f(00—0 


Prove that if the eigenvalues of the Jacobian matrix J(0), where 


Sud uL A 
Ox, ôx: OXn 
a=) sp É : 
fn fn .., Ofn 
Ox, Ox» x, 


have negative real parts, then each solution which initiates near the origin must 
approach the origin as 1 > co. 
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Problem B-8-9 Determine the stability of the origin of the following 
system: 
Xue xs — xxi T xi) 


X) —X1 — X: — X(x? xj 


Problem B-8-10 Determine the stability of the origin of the system 


K 
(s+s,) (s+55) 


shown in Fig. 8-9. 


* = 
J Ale) de 20 for all e#0, ....2(0)=0 
o 
5,70, S,> (0) 
Figure 8-9 Nonlinear control system. 


Problem B-8-11 (8-2) Consider the discrete-time system 
x((k +- 1)T) = Gx(kT) 


where x is an vector and G is an n X n constant matrix. Prove that the eigenvalues 
of a constant matrix G are less than p in absolute value if and only if, given any 
positive definite real symmetric matrix Q the linear equation 


p'G'PG-—-P--Q 
has a unique, positive definite real symmetric solution P. 
Problem B-8-12 Consider the discrete-time system 
x((k + 1)7) = Gx(kT) 
which is obtained by discretization from the linear continous-time system 
x = Ax 


If the equilibrium state of the continuous-time system is asymptotically stable, 
then show that the equilibrium state of the discrete-time system is also asymptot- 
ically stable. 


Problem B-8-13 Determine the stability of the equilibrium state of 
the following system: 
x(k + 1) = x(k) + ax«(K) 
x(k + 1) = —axi(Kk) — (a — 1)xXK) — axi(k — 1) 


here L danatac tha Lth comnlina time ond n ic 5 ^nnctont 
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Problem B-8-14 By use of a specific example show that although 
the output of a discrete-time system at sampling instants may be bounded, it does 
not imply that the output for tọ < t < co is bounded. (Note that since the effect of 
unstable poles can be canceled by the sampling operation, the output at sampling 
instants can be finite but growing oscillations may exist between sampling instants.) 


Problem B-8-15 Prove that the maximum (minimum) value of the 
Hermitian form x*Ax, subject to the condition that x*x — 1, is equal to the greatest 
(least) eigenvalue of A. É 
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Optimal Control Systems 


9-1 INTRODUCTION 


Recently a number of techniques for designing optimal regulators and 
controllers have been proposed in the literature. In this chapter a direct 
method and a method using the functional equations technique of dynamic 
programming are presented for analytical design of optimal control systems. 
The plants to be considered in this chapter are linear and stationary and the 
performance indexes considered are quadratic or Hermitian. All.control 
processes considered in this chapter are of finite duration and deterministic. 
The optimal control problem here i$ to find the control vector u(t) such as to 
have the state vector x(t) as close as possible to some ideal; for example, in 
the regulator problem to have x(t) as close to some constant vector as 
possible. 

In practical optimization problems we must consider certain constraints 
on the allowable value of the control vector u(t). In some cases the control 
vector is subjected to saturation. In other cases the energy available for 
control may be limited. In this chapter optimal control systems are designed 
with constraints on the energy available for control. Mathematically the 
constraint on the available energy can be included in the performance indexes. 

Consider a control system which is represented by the following state 
space equation: 


X = Ax + Bu, x(0) =c (9-1) 
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where x — n vector (state vector) 
u =r vector (control vector) 
A= n X n constant matrix 
B=n xr constant matrix 


Equation (9-1) gives the relationship between the state vector x and the 
control vector u. The problem is to steer the state vector x from an initial 
state c to some final state or region in the time interval 0 < ż < T. In doing 
this we wish to minimize the performance index which is given by 


J(e, T) = J,(x(T)) + 3 J(x, u, t) dt (9-2) 


where J,(x(7T)) is a function of x(T) and J,(x, u, ¢) is a function of x, u, 
and f, by choosing the optimal control vector u(t) in the interval 0 < t < T. 
[Specific expressions for J,(x(T)) and J,(x, u, ż) are given later in Sections 
9-2 to 9-5.] Notice that the minimum value of the performance index is a 
function only of the initial state c, the time duration T, and the control vector 
u. In Eq. (9-2), J; is evaluated at the terminal state X(T) and J, is evaluated 
over the actual trajectory. In this chapter we limit these functions to be 
quadratic or Hermitian. 

In general, different performance indexes lead to systems with different 
types of behavior. What kind of performance index leads to the most desir- 
able system is largely a subjective question and cannot be answered without 
reference to a specific application. 

Itis a well known fact that the problem of minimizing the performance 
index for the general case is usually difficult to carry out and leads to a 
nonlinear feedback system whenever the performance index is of neither 
quadratic nor of Hermitian form. Thus, for nonquadratic or non-Hermitian 
performance indexes the solution may have to be obtained by numerical 
techniques by use of a digital computer. The possibility of solving the problem 
analytically whenever the performance index is of quadratic or Hermitian 
is a major advantage of the quadratic or Hermitian performance indexes. 

Note that optimal control systems to be designed in this chapter based 
on given performance indexes are optimal for all initial conditions if all state 
variables can be measured. Note also that although the plants considered in 
this chapter are liaear time-invariant, it will be seen in the following sections 
that if the duration of time in the control process is limited the optimal con- 
trollers become time-varying. (Hence such optimal control systems are time- 
varying even if the differential equation of the plant is time-invariant.) 

This chapter is organized as follows: In Section 9-2, we present an ana- 
lytical technique for designing noise-free optimal regulators. For given 
performance indexes we obtain by the direct method the optimal control 
vectors and the minimum values of performance indexes. Section 9-3 
presents the functional equations technique of dynamic programming. We 
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apply this technique to the analytical design of noise-free optimal regulators. 
Section 9-4 discusses analytical techniques for designing minimum error 
regulators. Finally, Section 9-5 presents analytical design of minimum error 
controllers with polynomial inputs. 


9-2 ANALYTICAL DESIGN OF NOISE-FREE OPTIMAL 
REGULATORS 


In guidance and orbit transfer problems, minimization of system terminal 
errors is important. Such a problem is usually called a noise-free optimal 
regulator problem. In this problem the system is initially at a state c which 
is different from the desired state. It is required to bring the system state as 
close to the desired terminal state, the origin of the state space, as possible by 
applying a control vector to the plant. The optimal design procedure can be 
applied to either a finite control interval 0 < ¢ < T or infinite control inter- 
val 0 < t < oo. (This is in contrast to classical design methods which require 
that the control interval be infinity.) In this chapter we assume that the 
control interval is 0 < ? < T. In solving this problem we formulate it as 
that of a minimization problem, that is, the problem of minimizing, for 
example, the terminal error 


x*(T)x(T) = |x (T)? + |x(T)* + --- + 1x (D) 
In this section we shall assume that the energy for control is specified, or 


T 
Í u*(t)u(t) dt = constant 
0 


By use of a Lagrange multiplier à, the constraint on the available energy can 
be included in the performance index. The performance index for this system 
can then be written as 


Ke, T) = x*(T)x(T) +à Í : u*(du(t) dt 


The technique of the Lagrange multiplier is widely used for problems of this 
type, where one condition is to be a minimum under other restrictive con- 
ditions. In fact, the conditions need not be in integral forms, and any func- 
tional or differential relation among variables can be handled in a similar 
manner. 

The problem here may be restated in a general form as follows: Given 
the plant governed by the following state space equation: 


X = Ax + Bu, x(0)=c (9-3) 
where X =n vector (state vector) 
u =r vector (control vector) 
A =n X n constant matrix 
B =n xr constant matrix 


and the performance index defined by 
Ke, T) = x*(T)Px(T) + » f " u*(ryu(r) dt 
0 


where P = (p;;) is a positive definite (or positive semidefinite) real symmetric 
or Hermitian matrix such that pu = p; if x, and x; are a complex conjugate 
pair, and X is a Lagrange multiplier which is a positive constant indicating 
the weight of control cost with respect to the minimizing errors, find the 
optimal control vector u(t) which minimizes the performance index. Notice 
that the elements of P depend on how the terminal errors are to be weighted. 
In this section we shall solve this problem by the direct method. 

Note that in obtaining numerical solutions it is often advantageous to 
simplify the state space equation into a simpler one where the matrix A is 
in the Jordan canonical form. Notice that a quadratic performance index in 
terms of a real state vector may then become a Hermitian performance 
index with respect to a new state vector, if itis complex. Note also that if a 
state vector of a physical system is a complex vector, then any complex 
elements of the state vector always occur as complex conjugate pairs. 
Throughout this chapter equations for optimal control of linear systems are 
derived for the case involving complex vectors and complex matrices. 
Therefore, the notation of conjugate transpose appears frequently in 
equations. We shall, however, occasionally use the notation of transpose in 
particular examples if the system considered involves only real vectors and 
real matrices. 


Solution of noise-free optimal regulator problem by the direct 
method (9-7) The solution of Eq. (9-3) is given by 
t 
x(r)-—e" le + f e-^ Bu(s) as| 
0 


Let us denote by J,(c, T) and x,(t) the J and x corresponding to a special u, 
where u, will be specified later. Since 


xr) = e^ [e + f e-**Buy(s) ds] (9-4) 
we have 
x(t) = x«(t) + f eA“ B[u(s) — uo(s)] ds 
Then J(c, T) can be expressed as 
J(e, T) = Ide, T) + x3(T)P | : e^7-? B[u(s) — uj(s)] ds 
+ (tu) uG)* Br ^79) dsPx (T) 


1Figures in parentheses refer to material listed at the end of the chapter. 


T Ir e* 7-9 B[u(s) — u(s)] &|'»if. e*7-? B[u(s) — u,(s)] ds 


+ xf [u(7) — uo(t)]*[u(t) — u?)] dt + X f : uz (r)[u() — ur] dt 


+A f. fut) — w(Ol*u() di (9-5) 
We shall define u,(t) by 
Au,(t) = —B*(e*?-%)*Px,(T ) (9-6) 


Then the second and the sixth terms on the right-hand side of Eq. (9-5) can- 
cel each other. Similarly, the third and the last terms on the right-hand side 
of Eq. (9-5) cancel each other. Hence 


He, T) = Ae, T) + X f^ tu — uDU — ue) d 


+ [fe^ Biu(s) — u) as |" P [ f7 eA?» Bu) — col ds | 


(9-7) 
Since X > 0 and P is positive definite or positive semidefinite, Eq. (9-7) 
indicates that u, gives the minimum value of J(c, T). Thus, u, is the optimal 
control vector. From Eq. (9-4), we have 


x(T) = UE +f ; e-^'Bujs) ds | (9-8) 
Substituting Eq. (9-6) into Eq. (9-8), 
x(T) = e*t|e — L f eM BBH(eAt-9)*P ds x(T)| 
from which 
x(T) = È +i * e^ T-9 BB*(e^tT-?»)*p ds| “ete (9-9) 
(The indicated inverse always exists.) Substituting Eq. (9-9) into Eq. (9-6), 
Anjt)- —B*(e^v-»y pr +4 É g^-9 BB*(e^-»y«p ds| e^ 
uj(7) can then be written as 
u(t) = —Bre^e-oepba + f e^ C-9 BB*(e^ T-2)*p ds| "ere 
or 
wf) = —B*(e^?-9)*P AE + ^ e^ BB*(e™)*P dr] e"e (9-10) 
In particular, we have at t = 0 
u(0) = —Br(eA7)*P| AI + fe BB*e^*P ds| ee (9-11) 


Eq. (9-11) gives the optimal control vector at t = 0. 
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Note that Eq. (9-10) specifies the optimal control law u,(t) as a function 
of the initial state of the plant. This means that Eq. (9-10) gives an open 
loop optimal control law. In controlling a plant by use of the open loop 
optimal control law, the control vector for the control duration is prepro- 
grammed on the basis of the initial state of the plant, x(0) — c. Open loop 
optimal control is optimal only in the absence of the disturbances. 

In general, the feedback optimal control law where the control vector is 
specified as a function of the present state of the plant is better than the open 
loop optimal control law. By means of feedback optimal control, we can 
utilize the most up-to-date information on the state of the plant. In the 
presence of external disturbances and plant parameter variations, optimal 
control or nearly optimal control of a plant is possible only by feedback 
optimal control. 

Let us derive the feedback optimal control law from Eq. (9-10) or Eq. 
(9-11). This is possible by the following procedure: Let T — t be the duration 
of time remaining in the process and x(t) be the current state. Equation 
(9-11) can then be modified into 


I z "———À— 1-1 
u(t) = —B*e^*-^ Pa + É e BB*(e^)*P ds | e^ c-0 x(t) 
(9-12) 


[Note that in the noise-free optimal control system, x(t) = x«(1).] 


Optimal 
Controller 


(A) x (O) 


Optimal 
Controller 


Figure 9-1 Block diagrams for optimal control systems. (A) Open loop optimal control 
system; (B) feedback optimal control system. 
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Equation (9-12) specifies the optimal control vector u,(/) as a function of the 
present state x(t) and the remaining time in the process, T — t. Thus Eq. 
(9-12) gives the feedback optimal control law. Figures 9-1(A) and (B) 
illustrate an open loop optimal control system and a feedback optimal con- 
trol system, respectively. [In the present optimal regulator problem the 
input &(r) is zero for 0 < t < T] 

Consider the control law defined by 


u(t) = —F(T — t) x(t) 


where 


T-t -1 
KT — 1) = B*(e^v-opba + f eM BBH(e™)*P as | enero (9-13) 


Clearly, this control law is the optimal control law for the present regulator 
problem. Notice that the explicit form of the optimal control law is deter- 
mined by the matrix F(T — t). The block diagram of the control system 
based on this law, the feedback optimal control law, is shown in Fig. 9-2. 
The optimal control vector is a linear function of x(t), because the perfor- 
mance index is of quadratic or Hermitian form. The time-varying element 
F(T — t), the optimal regulator, can be considered as time-varying gains. 
If the measurements of the state variables and also T — 1, the time remaining 
in the process, are readily available and reasonably accurate, the optimal 
control system thus designed will exhibit high performance. 


Figure 9-2 Optimal control system. 


One elementary implementation of the time-varying gain is as follows: 
If the remaining time T — ¢ is available as an angular shaft position, the 
time-varying multiplications can be realized from a set of loaded poten- 
tiometers. The signal on the wiper of such a potentiometer would then be the 
appropriate function of the shaft angle T — ¢ times the excitation voltage 
applied to the loaded pot. The shaft angle T — t is achieved by placing an 
input signal T — ¢ to a position servo. 

Next, we shall obtain J,(c, 7), the minimum value of J(c, T). Define 


fc, T) = J«(c, T) 


Then by use of Eq. (9-6) and Eq. (9-9), we find 
T 
fle, T) = Jc, T) = xt(T)Px«(T) + a f už (uj) dt 
= x3(T)Px(T) + È xg(T)P f! e^? BB*(e^T-0)*P dt A(T) 
0 
E xr +4 e^- BB*(e^ ?-^)*P dr (T) 
0 
1 T -1 
=t I T x PÍ e^ BB*(e**)* as| Pec 
0 


» -1 
— pea + [^ e BB*e^9*P as| er]e (9-14) 
0 
Thus, f (c, T) is found to be a quadratic or Hermitian form in c. In terms of 
the remaining time T — t and the current state x(t), the minimized perfor- 
mance index can be given by 


f(x), T — 1) 
= x*(r) perma T fc e^ BB*(e^*)*P ds| e^] 
(9-15) 


The optimal control law, Eq. (9-12), previously obtained is based on the 
assumption that the exact information on the present state of the plant is 
available. In practice, however, this may not be the case. If disturbances are 
present, we must treat the problem from a stochastic viewpoint. If the state 
variables of the plant are not all accessible for direct measurement, optimal 
control of the plant must be accomplished by use of the best approximation 
R(t) of the actual state x(t). The result in this case is, of course, not optimal 
but is nearly optimal. 


Question of stability (9-2, 9-3) It is necessary to make sure 
that the optimal control system designed is physically realizable. This can be 
assured if a Liapunov function satisfying the conditions of asymptotic 
stability exists for the optimal control system, since existence of such a 
Liapunov function means that the equilibirium state of the system is asym- 
ptotically stable and the system can be physically realized. 

In the present problem, if P is positive definite, then f(c, T) is a Liapunov 
function. This can be seen as follows: From Eq. (9-14) the expression for 
fe, T) is always positive if c +0 and is zero only if c = 0. Hence f(c, T) 
is positive definite. That df/dt is a nonpositive function of time may be seen 
from the physical meaning of f(c, T). From Eq. (9-14) we see that, once the 
initial condition c is given, f(c, T) is fixed. Since the control energy spent 
along the trajectory for any time interval 1, && t & 1, or 


| 


f uf(t)u,(t) dt 


is always a positive quantity, the function f(c, T) along the trajectory always 
decreases with time, or 


Ho itex0 

df|dt becomes zero only if c = 0. Hence df/dt is negative definite. We thus 
see that f(c, T) is a Liapunov function satisfying the conditions for asymp- 
totic stability. This is true for any T > 0 including T = co. We can now con- 
clude that the optimal control system designed in this way is always 
asymptotically stable for 0 < t < T. 

So far we have been concerned with the system behavior for the time 
interval 0 < 7 < T and nothing is said about the system behavior for t > T. 
The system behavior for t > T is determined by the control vector u for 
t > T. If we simply use 


u(t) = —F(0) x(r) 


for t > T, then the stability of the system for t > T is determined by the 
eigenvalues of the coefficient matrix of the following system: 


. x = [A — BF(0)|x 


If the eigenvalues of A — BF(0) possess all negative real parts, then the system 
is also asymptotically stable for t > T. Otherwise, a stability device which 
does not go into action until the final instant of the transient period is 
needed to avoid running away from the final state x(T) after the duration of 
the control process is over. 


Determination of Lagrange multiplier à The value of the 
Lagrange multiplier X is determined from the constraint on the energy 
available for control. If the total energy for control is specified, namely 


T 
f u*(r)u(r) dt = K = constant 
0 


then the value of X can be determined by substituting Eq. (9-10) into the 
preceding constraint equation and solving for X. Clearly, X is a function of 
the initial state c. Note that as the amount of the available energy is decreased 
the value of X is increased. (Hence, X is sometimes interpreted as the “price” 
of the energy. The more limited the available energy, the higher the price.) 

Note that in some cases the amount of energy available is not limited, 
but we wish to minimize the sum of x*(T) Px(T) and the cost of total energy 
spent in the process; namely, minimize the following J(c, T): 


Je, T) = x*(T)Px(T) + u f j u*(r)u(r) dt 
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where yu is the known price of the energy per unit amount. Then, this problem 
can be solved in exactly the same way as the problem just solved which 
involves the Lagrange multiplier X. In this chapter the optimal control vector 
is obtained in terms of A, which is either given (in which case X = u = known 
quantity) or can be determined once the total energy available for control is 
specified (in which case X is a Lagrange multiplier). 


Example 9-1 Consider the control system shown in Fig. 9-3. The 


Optimal 
Controller 


Figure 9-3 Control system. 


input r(£) to the system is removed at t = 0, or r(t) = 0 for t> 0. The performance 
index for this system is given by 


(e, T) = x (T) T) + rf" u'(Du(t) dt 


where x and u are real and X is a positive constant. The time interval 0 Cr «c T 
may be chosen any duration during which the command input to the system or 
external disturbances are zero and during which the plant to be controlled moves 
from one essential level of operation to another. It is desired to minimize the 
performance index. Obtain the optimal control function u(t) and the minimized 
performance index f(c, T). ‘ 

The system differentia] equation may be written in the form of the followingstate 
space equation: 


X = Ax + Bu, " 


^b Pls] 


From Eq. (9-12) the optimal control function u(t) with P = I is given as follows: 


x(0) =c 


where 


T-t -1 
ust) = —B(eAt-9y/ + f eB (ey ds | e^T-0x(r) 


For this problem e^? can be easily obtained as 
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The integral expression becomes 


fc emere = f i j|; |e nl E 
o 9 {0 LLI s 


(r-o (T-0 


3 2 
(T j ty T-t 
Thus 
T-t "C 
[ar + [ eA’ BB’ (e^*)' ds] 
0 
Aue LU 
zi JJ hee 
A) (r-m r-0| 
ume A+ 3 
where 
cai 
A=N+NMT-H+4uT-9 +759 
Hence 


ut) = —B'(e^T-9YZ-1e^T-0x(f) 


= Qm - À) + T — 9/2) + + MT — HF + KT — 0/3] 
A + AMT — D) + AKT — 2/3] + KT — 10/12] 


The minimized performance index f(c, T) is given by 
T -1 
se T) = e*DuehyDa $ L e^ BB'(e^ry ds | Ate 


_ MA + T)c? + AAT + (T?/2)Jerce + [A + AT? + (T3/3)]c3} 
= A +AT + A(T3/3) + (T4/12) 


Figure 9-4 shows families of curves f(c, 7) with three different combinations of 
T and X. It is seen ir: Fig. 9-4 that, for a given T and a given value of f(c, T), increas- 
ing the value of X from 1 to 5 decreases the area enclosed by the f(c, T) curve. 
This indicates that the allowable initial magnitude of the error state is decreased. 
Hence, the effect of increasing X for a constant T is to increase the terminal error. 
This may seem rather obvious because, as X is increased, the terminal error is 
penalized less severely since the constraint on the energy for control is weighted 
more heavily. The effect on the terminal error of increasing T for a constant X is a 
decrease in the terminal error. It is clearly seen in Fig. 9-4 that for a constant A, 
areas enclosed by f(c, T) curves are increased as T is increased. 

Figure 9-5 is a block diagram of the optimal control system. The system involves 
time-varying gains. The block diagram of Fig. 9-6 shows a conventional control of 


fie l)=4 


7-1 T=! 
Azl =5 


f(c,5)-4 


Figure 9-4 Families of curves f(c, T). 


AHAT- + ig-D 
^ 


M(T=1) z (7-7)? 
^ 


4 
A X AU-D + AC nh. UD 
Figure 9-5 Block diagram for optimal control of plant defined by X, = u. 


the plant 1/s?. The control system designed by the conventional method is a linear 
time-invariant system and the control function u assumes the form u = —ax, — bx», 
where a and b are constants. The compensating element of the linear time-invariant 
system can be a simple RC circuit, whereas the optimal control system designed in 


Figure 9-6 Block diagram for conventional control of plant defined by 1/s?. 


this.section is time-varying, and the essential control unit has time-varying gains 
which are much more complicated than an RC circuit. Although the control system 
designed by the conventional method may be satisfactory in performance, the con- 
trol system designed here which involves time-varying gains is guaranteed to give 
the optimal performance under the present performance index. 

Note, however, that a decision on whether to build the optimal controller or a 
simpler but inferior controller depends on a measure of the degree to which the 
performance of the optimal system, if realized, exceeds that of the simple system. 

Finally, we shall examine stability of the optimal control system designed. The 
system designed is asymptotically stable during the time interval 0 << : < T, the 
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duration of the control process, since the minimized performance index f(c, 7) is 
a Liapunov function. After the end of the duration of the control process, if the 
following control law is used 

u(t) = —F(0)x(1), t> r 
then 


Since the matrices A and B are given by 


hme! 


the matrix A — BF(0) becomes as follows: 


0 1 
A — BF(0) — 1 
a 


The eigenvalues of A — BF(0) are 0 and i, Hence the origin (the equilibrium 


state) of the system is not asymptotically stable for t > T. Therefore, we need 
some device preventing drifting away from the final state x(7) after the duration 
of the control process is over. 


Example 9-2 Consider the control system shown in Fig. 9-7. This 
is an example of control of an unstable system. The input r(t) to the system is 
removed at ¢ = 0, or r(t) = 0 for t > 0. The performance index is given by 


He, T) = x'(T)x(T) + A u'(t)u(t) dt 


where x and u are real and X is a positive constant. Find the optimal control func- 


tion u(t). 


r=0 


Optimal 
Controller 


Figure 9-7 Control system. 


The system differential equation is 
k= ax u, a>0O, x(0) =c 
Hence A = A = a and B = B = 1. The optimal control function u(r) is given by 


T-t -1 
— B'(e^ -oya + r e43 BB'(e^*)' as] e^7-0 x(t) 


2 
= -IF aan eTo),  0«:«T 


uc(t) 
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2a 


-20 (T-t) 


1+ (20-1) e 


Figure 9-8 Block diagram for optimal control of plant defined by x = ax + u. 


Figure 9-8 shows configuration of the optimal control system. The minimized per- 
formance index f(c, T) can be found as follows: 


f(c T) — eMe» + T e^' BB'(e^*) ds] ete 


= 2aX c? 
I + (an — 1)je-?«? 
This is a Liapunov function. We can easily confirm this as follows: We may write 
f(x(t), T — t) in the following form: 


ftt) T — 0 = remy x07, O0<t<r 


Since a > 0, X > 0, and 1 + (2aX — 1)e-?e(T-0 > 0 we see that f(x(t), T — t) is 


positive when x(t) z 0 and is zero only if x(t) = 0. Hence, it is positive definite. 
The time derivative of f(x(t), T — t) is found as follows: 


2 
"IT Qao pererenp iO 

df/dt is thus a nonpositive function of time and is zero only if x(t) = 0; df/dt is thus 
negative definite for 0 < £ < T. Thus, f(c, T) is a Liapunov function for any posi- 
tive value of X. The system designed by the present method is asymptotically stable 
for 0 « t < T (where T can be infinity) and hence it is physically realizable. At 
t — T, the end of the control process, however, the system may run away from the 
final position x(T) as time goes on unless the system designed is asymptotically 
stable for t > T. 

If we assume that the time-varying term would remain constant at the value at 
t — T, namely, 

u(t) = — F(0)x(t), fo 

then we obtain 


uibs e x xD, t>T 


The system differential equation for t > T is then given by 


z=(a-1)x 


———— P emma 


M MM T MH — M áÓ—Ó 


For X < 1/a, the optimal control system is asymptotically stable and no separate 
stability device may be necessary at the end of the process, t = T. However, for 
X > l/a, a separate stability device is necessary in order to avoid running away 
from the final position x(7). 


9-3 FUNCTIONAL EQUATIONS TECHNIQUE OF 
DYNAMIC PROGRAMMING 


This section presents application of the basic functional equations technique 
of dynamic programming to the solution of optimal control problems. 
We shall see that in determining optimal decision rules for continuous con- 
trol systems by the functional equations technique of dynamic programming, 
we must solve partial differential equations which must be satisfied by the 
function representing the minimized performance index. The partial dif- 
ferential equation is derived from a recurrence relation obtained by applying 
the principle of optimality to the minimized performance index. The principle 
of optimality due to Bellman states! *An optimal policy has the property 
that whatever the initial state and initial decision are, the remaining decisions 
must constitute an optimal policy with regard to the state resulting from the 
first decision." The mathematical transliteration of this principle yields the 
functional equations we shall need in solving the minimization problem. 
Consider the control system defined by Eq. (9-1), rewritten thus: 


X = Ax +- Bu, x(0) — c 


where X — n vector (state vector) 
u =r vector (control vector) 
A =n X n constant matrix 
B-—n*xr constant matrix 
and the performance index defined by Eq. (9-2), rewritten thus: 


He, T) = J,(x(T)) + jd Y x, m, £) dt 


Our problem is minimizing the performance index J(c, T) by optimally 
selecting the control vector u(t) in the time interval 0 < t < T. We clearly 
see that the problem here is that of the variational problem of minimizing 
J(c, T) subject to Eq. (9-1) with constraints on the control vector u. By the 
functional equations technique of dynamic programming, we imbed this 
problem within the family of problems generated by allowing c and T to 
be parameters. 

Let us define the minimum value of J(c, T) over all allowable u as 
f(c, T), namely 


fle, T) = min J(e, T) (9-16) 


lFor detailed discussions of dynamic programming see (9-4, 9-5). 


We may interpret f(c, T) to represent a cost of a control process of duration 
T, the initial state being c, and an optimal control policy being used. Let the 
optimal solution evolve from state c at time 0 to state c + Ac at time Ar. 
At the new state we see that the same problem exists and, from Eq. (9-16), 
the variational scheme gives 


fle, T) = min E +f : j at| 


At T 
zin . quin i Tdi J + fi Jae) 


u(0CtcAt) u(AtCtcT) 


I 


min lj r J, dt + fle + Ae, T — ro] (9-17) 
where c + Ac is given by x(At), since 
x(At) =e + S At — e + Ae 


The minimization is taken over all allowable control vectors u defined over 
0 < t < T with x(0) = c. Equation (9-17) is the basic recurrence equation 
for the solution of the problem. Expanding Eq. (9-17) into a Taylor series 
assuming continuity of the partials, 


fc; T) min [44 u, £)At + f(c, T) + Ac Z L A63 L 4 


at - A e] 
t+ Ae Ator t 1 


Canceling f(c, 7) on both sides of the equation 


of _ [ xu Aaaf x, Bede 
ar oT min axa, DAt + (e ci a At acs T3 


a ar 


At óc, a )At ae 1 


Dividing both sides of the equation by Ar and then letting Ar approach 
zero, we obtain 


of = min EG u(0), 0) + (VA)*[Ac + Bu(0)]| (9-18) 


where (Vh)* = (Vf = (grad f) = E of dis 2. 


and 
Aci 
At 


Ac + Bu(0) = lim 


AZ 
At 
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Equation (9-18) is the basic partial differential equation satisfied by the 
minimized performance index. The control vector u which minimizes the 
right-hand side of Eq. (9-18) is the optimal control vector. Equation (9-18) 
forms the basis of the operation of the optimal controller. It must be solved 
either analytically or numerically in order to determine the optimal control 
vector which minimizes the performance index. 


Solution of noise-free optimal regulator problem by the functional 
equations technique of dynamic programming (9-5, 9-6, 9-7) | Consider the 
plant governed by the following state space equation: 

x = Ax + Bu, x(0)— c (9-19) 
where x =n vector (state vector) 
u—r vector (control vector) 
A =n X n constant matrix 
B=n Xr constant matrix 


and the performance index 
Ke, T) = x*(T)PX(T) + 2 f’ u*(2uC) ae 


where P = (p;,;) is a positive definite (or positive semidefinite) real symmetric 
or Hermitian matrix such that p;; = p;; if x, and x; are a complex conjugate 
pair, and X is a Lagrange multiplier which is a positive constant. Our prob- 
lem here is to find the optimal control vector u(t). Since the’ problem is to 
minimize J(c, T) over all allowable u, define the minimum of the performance 
index as f(c, 7), or 


f(c;T)-— min [x*(T)Px(T) + » f utut) dt| 
f(c, T) can be rewritten as 


f(eT)- min min [> rt u*(t)u(t) dt 


uiocteat) u(Atct«r) 
+ x*CPXC) +A Í i u*())u(n) at] (9-20) 
Equation (9-20) can be approximated by 
fle, T) = min [a f 4 u*(t)u(t) dt + fle + Ac, T — rol 


The partial differential equation that must be satisfied by f(c, T) can be 
obtained as 


2. = min [rur(O)u(0) + (V*tAc + Bu(O)] (9-21) 


where (VA)* = (Vf) = (grad fY 
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Note that if c is a real vector, then VA — Vf. If c is a complex vector, then 
Vh = V f. Note also that we can write 


Xu*(0)u(0) + (VA)*[Ac + Bu(0)] 
1 B * * B * 
-=> Dac) +> va] Dac) +> va] 


+ (VA)*Ac — (VD gp: Vh (9-22) 


Let us denote the optimal control vector, the control vector u(0) that mini- 
mizes the left-hand side of Eq. (9-22), as u,(0). Then, clearly the optimal 
control vector u,(0) is given by 


u,(0) = — i B* Vh (9-23) 


Substituting Eq. (9-23) into Eq. (9-21), 


d ue (- lp. va) (— j B* vi) 


fe (Va| Ac + B(— = Bt vi) 


which becomes 


-A - x (Vh)* BB*Vh + (Vh)*Ac 


Since the performance index is quadratic or Hermitian, a quadratic or 
Iiermitian solution of the form 


fc, T) = c* S(T)c 
is assumed. (In Section 9-2 we have shown that f(c, T) is actually of a quad- 
ratic or Hermitian form.) Since f(c, T) is a real scalar, f* = f, or 


c*S(T)e = fle, T) = f*(e, T) = c*S*(T)c 


Thus we can take S(T) to be real symmetric or Hermitian. By use of the 
relationships 


f Be and (VA ^ (Vf) — 2c*S 


we obtain 


= e[- lsBB*S + SA + A*S|c 
or, rearranging, 


" E — SA — A*S 4 X SBB*S]c esl) 


This last equation is satisfied if S satisfies the following matrix Riccati 
equation: 


dS 1 
gr — SA — A*S + + SBB*S = 0 (9-24) 


For the solution of Eq. (9-24), let 
S(T) = HT) (T) 
where Y(T) and Z(T) satisfy the following equations 


dY _ _ 1 pp* 
T^ AY + X BB*Z 
dZ _ 
T^ A*Z 
Since the solution of the differential equation 
dx 


gj; AX + Bu, x(0) — c 


is given by 
X(T) = efe + f eX Bu(s) ds 
the solution for Z(T) is obtained as 
Z(T) = (e^")* Z(0) 
and the solution for Y(T) is 
Y(T) = e-^Y(0) + f e-^9-» L BB*Z(s) ds 


= Levbxo 4. f e^ BB*(e^')*Z(0) as] 


The solution S(T) requires Y-'(T). Hence inverting ¥(T) 
T -1 
Y-(T)- XX) mi f e^ BB*(e^:)* Z(0) ds] e^t 
0 
The initial conditions are determined as 
fe, 0) 2 c* Pc = c*S(0)c 
Hence, 
S(0) =P 
Also we have the relationship S(T) = Z(T)Y-'(T) which yields 
S(0) = Z(0) Y-'(0) 
Let us choose Z(0) — P and Y-'(0) — I so that Y-'(T) exists. Then 
S(T) = Mew)" Pba +f " e^ BB*(e4#)* P ds| en 
0 


It can be seen that S(T) is real symmetric or Hermitian. f(c, T) is obtained as 
fe, T) = c*S(T)c 
P ald 
=“ E SES + f e“BB*(e")*P as] enle (9-25) 

0 
The equation of the optimal control vector u,(0) is given by Eq. (9-23), re- 
written 

u,(0) = — >- B* Vh 
Noting that 
Vh = 2S(T)c 


the optimal control vector u,(0) becomes 
=d pe 
u,(0) = x. B 2S(T)e 
= —B*(e^ Pha + ['e*'BB*(e^*P ds| e^ (9-26) 
0 


Letting T — t be the duration of time remaining in the process and x(t) be 
the current state, the equation of the optimal control vector becomes 


u= —B*(e^v-»y* p| AI +f E e^ BB*(e^)* P ds | e^o x(t) 


which is identical to Eq. (9-12). The minimized performance index can be 
given by 


f(x(t), T — t) = x*()S(T — t)x(t) 
= x*(r) (xe^ e Plat + d e“* BB*(e4*)* P ds | ene?) x(t) 


which is the same as Eq. (9-15). The solution thus obtained by use of the 
functional equations technique of dynamic programming is, as a matter of 
course, the same as that obtained in Section 9-2 by the direct method. 


9-4 ANALYTICAL DESIGN OF MINIMUM ERROR 
REGULATORS 


In this section we consider the problem of selecting the control vector from 
an allowable set so as to minimize a function of the system error vector over 
the duration of the process. The system error is the deviation from the ideal 
path. The deviation from the ideal can be measured in many different ways. 
In this section the system error is defined as the difference between the de- 
sired state which is the origin of the state space and the instantaneous state 
during the control process. In what follows we shall consider the case where 
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we wish to minimize the generalized quadratic or Hermitian error, subject 
to the constraint that the energy for control is limited. 
Assume that the differential equation of the plant is given by 
š = Ax + Bu, x(0)=c (9-27) 

where X — n vector (state vector) 

u =r vector (control vector) 

A=nX n matrix 

B=nxr matrix 


In the present regulator problem the system has no input for ż > 0. (The 
system is subjected to only the initial condition.) The desired state through- 
out the time interval 0 < ż < T, where T is the duration of time in the 
process, is the origin of the state space. Thus the system error vector is the 
negative of the state vector. The performance index is given by 


Ke, T) = x*(T)Px(T) + [x*Xaxo) de + f URO) dt (9-28) 


where P — positive definite (or positive semidefinite) real sym- 
` metric or Hermitian matrix such that Pit pj; if x; 
and x; are a complex conjugate pair 
Q(t) = positive definite (or positive semidefinite) real sym- 
metric or Hermitian matrix such that qu = q;; if x; 
and x; are a complex conjugate pair 
R(t) = positive definite real symmetric or Hermitian matrix 


The matrix P determines how the terminal errors are to be weighted. The 
matrix Q(t) determines how the instantaneous errors are to be weighted 
during the control process. The matrix R(t) represents the weight of control 
cost with respect to the minimizing error. In the following material, this 
problem will be solved by the method based’ on the functional equations 
technique of dynamic programming and by the direct method. 

(1) Solution of minimum error regulator problem by the func- 
tional equations technique of dynamic programming (9-6, 9-7) Let f(c, T) 
be the minimized performance index. Then from Eq. (9-28) we obtain 


f(c, T) = min J(e, T) 


= min [x COPXCT) ET f x*(DQU)x()) dt + r u*()R(Du() at| 


Splitting the interval of integration into two parts 0<t< At and 
At < t < T, and applying the principle of optimality, we obtain 


fle, T) = min | [> Pex) + w*CORGw() dt 
+ fle-+ Ae, T — AD) 
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from which we obtain, after expanding f(c + Ac, T — Ar) into a Taylor series 
and dropping f(c, T) from both sides and dividing the remaining terms by 
At, 


sitis | am * v» Of Aci Of 
0 — min [x Qx +u Rit 3. 3279 ap + AD) 


In the limit as At approaches zero, this last equation becomes 


a = mia |e* Qc» + u*(0)R(O)u(0) + (VA)*[Ac + Bu(0)] (9-29) 


where (VA)* = (Vf) = (grad f) 
The right-hand side of Eq. (9-29) becomes minimum when 
2R(0) u(0) + B* Vh = 0 


The control vector u(0) satisfying this equation is the optimal control vector. 
Therefore, denoting the optimal control vector as u,(0) we have 


u,(0) = — E B* Vh (9-30) 
Substituting Eq. (9-30) into Eq. (9-29) 
a = c*Q(0)e + (VA)*Ac — L(VE)-BR-(0)B* VA — (9-31) 


Assume the following quadratic or Hermitian form for the solution 

f(e, T) = e*S(T)c (9-32) 
where S(T) is real symmetric or Hermitian. From Eq. (9-31) and Eq. (9-32) 
noting that VÀ == 2S(T)c 


d Are = e*[Q(0) + SA + A*S — SBR-((0)B*S]e (9-33) 


For a solution of Eq. (9-33) find S which satisfies the following equation: 
& —SA — A*S + SBR-(0)B*S = Q(0) (9-34) 
This is a nonhomogeneous matrix Riccati equation. The solution of Eq. 
(9-34) may be obtained by letting 
S(T) = Z(T)Y-(T) 


and solving the set of matrix differential equations: 


a” = — AY(T;) + BR-XT — T,)B*Z(T)) 
dZ(T;) 


dT, 


(9-35) 
= Q(T — TYY(T) + A*Z(T) 


—— MÀ — M —— 


Consider the casé where Q and R are constant matrices. Define 


WT) = E and M = Ue 
Z(T) Q; «a 
Then Eq. (9-35) becomes 
daw(T) _ T 
TT CMWT) (9-36) 


The initial condition for this matrix differential equation is obtained as 
follows: Since 


f(c, 0) = c*S(0)c = c*Pc 
we obtain 
S(0) — P 
Noting that 
S(0) = Z(0) Y-'(0) 
we select Z(0)— P and Y(0) — I so that the inverse of Y(T) exists. Thus 
the initial condition of Eq. (9-36) is given by 
I 
W(0) — H 
The solution of Eq. (9-36) is then obtained as 
W(T) = e""W(0) 
If we express e"? in the following 2n x 2n matrix 


eM? = Is l, ®,(T) =n x n matrix 


(i,j = 1,2) 
then, 
Eure IE — 
Z(T) (T)! (T) LP 
or 
Y(T) = (T) + ®.(T)P 
Z(T) = ®2,(T) + &s(T)P 
Hence 
S(T) = A(T)Y-\(T) 
= [®.(T) + 9,(T)P] {®i(T) + 9,(T)P]" 
and 


uj(0) = —R“B*S(T)c = —R^B*Z(T)Y (T)c 
= —R^ B*[O,(T) + $,(T)P] [D (T) + B(T] 'c 


f(c, T) is obtained as 
Ke, T) = c*S(T)e = c*Z(T)Y-(T)e 
= c*[B.(T) + ,(T)P][dSu(T) + O,(T)P]-"c (9-38) 
Let T — t be the duration of time remaining and x(t) be the current state. 


Then the optimal control vector [noting that (T), ®,.(T), (T), and 
®.,,(7) are functions of T] becomes 


u(t) = —R^!B*S(T — 1)x(t) = —K(T — 1)x(2) (9-39) 


where 
F(T — 1) =R'B*[@.,(T — t) + (T — D)P][O.(T — t) + (T — HPI! 
OTT 
and 


JEA), T — t) = x*(0)S(T — x(t) = x*()ZT — Y(T — t)x(t) 
= x*(t)[®.,(T = 2) T (T — t)P)[®1,(T — t) + (T — DP]^'x(f) 
The block diagram of the optimal control system is shown in Fig. 9-9. The 
system designed (if |P| > 0) is asymptotically stable during the process of 
control, 0 < t < T, since f(c, T) is a Liapunov function of the system. As 


for stability of the system for t > T, the analysis made in Section 9-2 also 
holds true in this case. 


Reference Input 
=0 for 7/20 


Figure 9-9 Optimal control system. 


(2) Solution of minimum error regulator problem by the direct 
method (2-7) The solution to Eq. (9-27) can be obtained as 


x(s) = ele ES f e-^' Bu(t) dt| 


Denote by J,(c, 7) and x,(s) the J and x corresponding to a special u,. 
The control vector u, will be specified later. Since 


X(s) = ende -d- f e~“*Bu,(t) ar| 


we have 


x(s) — xo(s) = [eA Bruto) — uw?) dt 
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Define y(t) = x(t) — x,(t). Then 
Je, T) = Ae, T) + y"(TIPMT) + f YOOO dt 
+f tuo — WORO) — wD] dt + xB(TIPY(T) 
Y COP) + | OOOD at + [YOK 


+ [E WORO) — w(0) dt + f"tutr) — uCOI* Ru Xt 
(9-40) 
Since 
x*(T)Py(T) = xi(T)Pe**| e-^'B[u(7) — u(2)] dt 
and 
f xi (5)Q(s)y(s) ds 
= fixe! f’ eA Bu) — wo) de] ds 


= [ (J; x Qe ds) e~ BMC) — wi dt 


t 


where we have used the relationship 


[a (( f ect )as = J AT. f(s)ds) g(t)dt 


the sum of the last six terms of the right-hand side of Eq. (9-40) can be 
written as 


JÉ U*totuto — wu) de + [^ tut) — woo 
where 
U(t) = B*(e-^*)*(e^7)* Px,(T) 
+ B*(e7^5* f (e^*QGyxsGS) ds + Ruli) 


We shall determine u,(t) by solving the equation U(t) = 0. With this u,(t) 
the sum of the last six terms of the right-hand side of Eq. (9-40) is zero. 
Noting that P and Q(t) are positive definite or positive semidefinite and 
R(t) is positive definite, we see that u,(t) gives the minimum of J(c, T). 
Hence u,(t) is the optimal control vector. It is given by 


u(t) = -RNB e-^) (e^? Px(T) 


+ f 0*9») as] (9-41) 
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Since 
ù l 
x(s) = e^9-P x (T) — J e^6-? Bult) dt (9-42) 
we have 


x(5) = | e^e-n + f" e^v-o BR-()B*(e-^)* dr (e^*P |xT) 


+f : e^6-? BR-'(7B*(e-^!)* (f (e^*)*Q(r)x«(o) de) dt 


Therefore, we have an integral equation 


o(s) = C(s)xo(T) + [^ DG, oso) do (9-43) 
where 
C(s) = eA? + f? e^t- ^ BR-()B*(e7^)* di (e^?)*P 
D(s, c) = e^ f e-^ BR-{1)B* (e7^)* dt (e^*)* Qa) 
where we have used the relationship 
[eX f e@de) at = f" (f roa) cae 
Let 
x«(5) = G(s)x«(T) (9-44) 
where G(s) is an n x n matrix. From Eq. (9-43) and Eq. (9-44), 
G(s)x (T) = C(s)x(T) + i D(s, e)G(o)x«( T) do 
Now consider the integral equation 
GG) = C(s) + f : D(s, )G(c) de (9-45) 


where G(T) — I and C(T) — I. There is a unique solution of Eq. (9-45). 
Let us put 


w(t) = L(1)x,(T) (9-46) 
where 
LE) = —R"()B*e7^?*| (eR + f^ ^9) as | 
Consider next the equation 
x(T) = eA? E +f à e-^ Buy) dt | 
= eit 4 f e-^ BL? dt x(T)| (9-47) 


Solving Eq. (9-47) we get 
x(T) = [x — eA? fea BL) dt] ‘este (9-48) 
0 


Equation (9-46) and Eq. (9-48) define the optimal control vector u(t). 

In the present approach, we need to solve the integral equation (9-45) 
to determine u,(t). The problem of solving this integral equation, however, 
can be converted to that of solving the matrix Riccati equation (9-34) or 
its equivalent matrix differential equations (9-35). We shall show this in 
what follows. Since Eq. (9-45) can be written as 


G(s) = e^6-^ + f e^e-* BR-(OB*(7^)* dt (e^?)* P 
+f i e^6-? BR- ()B*(e- ^)* | f i (e^7)*Q(o)G(e) do | dt 
we have 
G(s) = e*e-^ + f : e^6-0 BR-()B*K(1) dt 
where 
K(s) = (67^) | (e^) P + f" (e*)* x06 d | 


Differentiating G(s) and K(s) with respect to s, we obtain 


266) = AG(s) — BR-\s)B*K(s), |. G(T)—I 
and 
KO Z -Q()G(s)— A*K(), —K(T)—P 


Note that T — sis the remaining time in the process. Define 


Y(T—s)—G()  Y(0-—G(T)-I 


(9-49) 
Z(T — s) = K(s), Z(0) = K(T) = P 

then, ] 

dG(s) |  dYX(T — s) dK(s) _ . dZ(T — s) 

d  dqr-s ds d(T — s) 

and 

dX(T I— s) — -is -1 * ET uo. 

“UT = s) = — AY(T — s) + BR^(s)B*Z(T — s) 

dZ(T — s) = "ER * -— 

WT-s- Q(s)Y(T — s) + A*Z(T — s) 


By letting T — s = T3, we obtain 


Ea = —AY(T) + BR-(T — T)B*Z(T;) 

(9-50) 
dZ(T, 
vi = Q(T — T)Y(T) + A*Z(T)) 


which are the same as Eq. (9-35). We have thus shown that the problem of 
solving Eq. (9-45) can be converted to that of solving Eq. (9-35). 

The value of f(c, T), the minimum value of J(c, T), can be obtained 
[without assuming the quadratic or Hermitian form given by Eq. (9-32)] 
as follows: By using the relationships 


L(t) = — R-(1)B*K(1) 
and 


K*()BR-'()B* = G*(sA* — ZEO) 
we have | i 
f(e, T) = Je, T) 
= xtCDPX(C) + f xs) ds + f “ w3(s)R(s)u(s) ds 
- xr)? +f a G*(S)Q(S)G(s) ds + f : L*(s)RG)L(s) ds! x(T) 
= xir)? -Í À G(s) [£KG) + A*K(s)} ds 
+f 'K*(S)BR-(()B*K(s) 4| x(T) 


= xt(7)|P - f ! G*(s) aes) ds — f 


0 


ars) K(s) ds| X(T) 


= xt) -f f 4 (G*(s)K(s) ds |x) 
= x3(T)[G*(0)K(0)]}x.(T) 


Since 
x«(s) = G(s)x,(T) 
we obtain 
X(0) = G0)x.(T) and xji(0) = x#(T)G*(0) 
Hence, 


Jic T) Ae; T) = x$(0)K(0)G(0)x.(0) = c*K(0)G- '(0)c 


Thus, f(c, 7) becomes a quadratic form or a Hermitian form. In terms of 
Y(T) and Z(T) given by Eq. (9-49) and Eq. (9-50) we obtain 


f(c; T) — Je, fc ZY dne 
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Example 9-3 Given the control system shown in Fig. 9-10 and the 
performance index 


Jc, T) = x (T)x(T) + f x'(t)x(t) dt + &L u(tu(t) dt 


find the optimal control function u(t) and the minimized performance index 
f(c, T) 2 min J(c, T). 


Optimal 
Controller 


Figure 9-10 Control system. 


The. differential equation of the plant is 
X = Ax + Bu, x(0) =c.. 


where x, u, A, and B are scalars. Clearly, A = —1 and B = 1. From the preceding 
analysis it can be shown that the optimal control function u(t) and the minimized 
performance index f(c, T) are representable as 


ut) = — È B'S(T—2)x() and fle, T) = S(T) 
S(T) = [balT) + $2XC)0 [6:(7) + i O)17! 
and $1:(T), di T), 2i(T), and (T) are defined by 


eMT = Fes i m 


where 


$u(T) | p(T) 


where 


The eigenvalues of M are found from 


1 
= =P-—~-]-—-—= 
IM- | =é 1 X 0 
or 
& = Al UM L & — : 


Let us use Sylvester's interpolation formula to obtain e™7. 


| 
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1 & eT 
1 & e57|-0 
I M eMT 
Noting that É: = —£, and expanding this determinant around the last column 
T£ — L 1 -fi — 1 
m an (1 + Eet? er er) | 
1 g^ *T — ghT (1 - E elt —(1+ Even? 
Then, 
palT) a bx(T) - (eb? + e-hT) 
and 
—3 Al Yenr.(1- A) te 
$«7) + GAT) = zr [(1 + E X)e7- (1-8 + Zee] 
or 
= 2& 
T) + d(T)! = ————— ————— 
oul?) + oe" = EE ASIE - [i — & Oe 
Thus 


S(T) = [Gu(T) + p(T) I68(7) + p(T) 


» £i(e^iT T e-^T) 
fl + & + 0/Xge7-[ — £ + Nei 


Substituting £j, = V(A + 1)/X into the preceding equation 
1 + e- vV AFIAT 


SD = TEIA IN HU — WO D/AJe Rm 


Hence 

adi s. —(1 + e-?V à *07X(-0)x(t) 

NY NE AX. TD)-c-[^- VM + 1)je yV A+A T-t) 
and 


1 + e-v AFIAT 
f(c, T) = " 
] 4 (o 1/A + [1 — MA t+ 1)/A]Je7? V 6 D7AT 


Figure 9-11 shows a block diagram of the optimal control system designed. 


2 


e= VO4/x (T-t) 


i+ 
Xe [1- AG «n]e eee 


Figure 9-11 Block diagram for optimal control of plant defined by x = —x + u. 


9-5 MINIMUM ERROR CONTROLLERS WITH 
POLYNOMIAL INPUTS (9-5, 9-6, 9-8) 


The problems considered in Sections 9-2 to 9-4 are formulated as regulator 
problems; namely, the system has no input and is subjected to only the initial 
condition. In this section we shall consider systems with nonzero inputs 
which may be approximated by polynomials in f. In synthesizing a minimum 
error controller it is assumed that the polynomial input is known as a func- 
tion of time for a given time interval 0 < t < T. Note that if a system is 
designed so as to perform optimally with respect to some class of signals 
such as polynomials in ż of degree m, then the same system will perform 
nearly optimally in response to any other input signal which can be ap- 
proximated suitably by an mth-degree polynomial in t, over some interval 
of time. 
Consider the control system shown in Fig. 9-12. The state space equation 
of the plant is given by 
X = A,x + Bu, x(0)— c 
where X — n vector (state vector) 
tü = r vector (control vector) 
A, =n X n constant matrix 
B, =n X r constant matrix 


y D-FU- x0) 


a (D-2HO- 0€), 
Figure 9-12 Control system. 


In the block diagram of Fig. 9-12 the feedback vector y(t) is assumed to be 
a linear function of the state vector x(t). The relationship between x(r) and 
y(t) is given by 

y(t) = F(T — t) x(t) 
where F(T — r) is an r x n matrix whose elements are time-varying. Here 
t is the present time and T — t is the duration of time rernaining in the 
process. 


Let {E(t)} be a class of system inputs for which the system is designed to 
be optimal. We assume that every member of the class Ee} satisfies the 
mth-order differential equation. We define an element E(t) of this class as 

EQ)=&(t) fort>0 
E(t) =0 for t <0 


where &,(t) is the first element of the input state vector &(t) which is defined 
by the following state space equation: 


=A 
where 
E,(t) 0 1 0 0 
E(t) 0 D ff 0 
«07 ib A-[iiio i: 
Ex-a(£) 000 ss 0 1 
Enlt) 0 :0 OQ «= Q 9 


The first element &,(t) of the input state vector (z) is the polynomial ap- 
proximating the actual input signal. The matrix A; is an m X m matrix. 
It is noted that, as far as the state variables of the input are concerned, 
these are strictly fictitious quantities. They have been introduced in order 
to permit optimizing system performance for a particular class of input 
signals. In the block diagram the vector %(t) is defined by the following 
relationship: - 


9(t) = H(T — 1) &(0) 


where H(T — t) is anr x m matrix whose elements are time-varying. 

The problem considered in the following is to determine, for a given 
polynomial input, the optimal controller or the elements of the matrices 
F(T — t) and H(T — t) such that the given system performance index is 
minimized. Such a problem may be solved by defining a new state vector 
z(t) consisting of x(t) and &(t) such that 


x(t) ] 


ud | 


ad " EA (9-51) 


En(t)_| 
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Clearly, z(t) given by Eq. (9-51) is an n + m vector. By defining 


Sal cars 
err 
(9-52) 

B. 

- 
0 

the state space equation of the system can be written as 
z= Az 4+ Bu 


Let us consider the quadratic or Hermitian performance index defined by 
l T 
(e, T) = z*(T)PZ(T) + a f, u*(rDu(t)dt 


where P is an (m +n) x (m + n) positive definite (or positive semi- 
definite) real symmetric or Hermitian matrix characterizing the weight of 
errors specified for performance measurement, and X is a positive constant 
which indicates the weight of control cost with respect to the minimizing 
errors. [Note that if x, and x, are a complex conjugate pair, then pi; = Pi; 
where P = (p,;).] Generally, z*(T) Pz(T) may be given by 


z*(T)Pz(T) = X ET) — eer i" 


where the w; are weighting factors, £, — x, is the error, &, — x, is the error 
rate, and so on. For example, if both x and & are two-dimensional vectors and 
if it is desired to minimize [E(T) — x(T)P + [&(T) — X(T)’, then the 
matrix P is given by 


In fact, 
z*(T)Pz(T) 
1 0 -—af 0 x(T) 
6. i 9-—IT|[|x 
= pa(T) x(T) &(T) &C)] ux 0 1 0|| &(7) 
0—1 0 TD 


= [É(T) — xT)? + (E(T) — xD 


It is possible to assign different weights to the individual errors by appropri- 
ately selecting the elements of P. 
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The derivation of the optimal control vector for this problem is identical 
with that of the problem discussed in Sections 9-2 and 9-3, where no input 
signal was considered. Therefore, the derivation of the optimal control vector 
to the present problem will not be repeated. From Eq. (9-12) the equation 
of the optimal control vector u,(t) is, for T — t units of time remaining in 
the process and z(t) (instead of x(t) as before) the current state, 


u(t) = —B*(e^?-?)* Pat 


+f E e^ BBY(e™)*Pds| e^o z(t) (9-53) 
0 


where in this case the matrices A and B are given by Eq. (9-52). Equa- 
tion (9-53) gives the optimal control vector u,(t) for any time ¢ where 
0t T. 

To find the equations for the time-varying matrices F(T — t) and H(T—:) 
note that 


u(t) = 7(t) — y(t) = K(T — NE) — F(T — t)x(t) 
and in matrix form 
x(t) 
§(t) 


By denoting F(T — t) and H(T — t) corresponding to the optimal control 
vector u(t) as F,(7 — t) and H«(T — t), we obtain 


u(t) = [— F(T — 0): HT — 0]z(0) 


u(t) = [- F(T — 2: (T — 2l -[- KT — 0: K(T — plan 


(9-54) 


Equation (9-53) and Eq. (9-54) represent the same optimal control vector. 
Hence the matrix [—F,(7 — t)|H,(T — t)] can be equated to the coefficient 
of z(t) in Eq. (9-53). Therefore, F(T — t) and H,(T — t) are obtained from 
the relationship 


[— FT — D); H«(T — 1] = —B*(e^7-?)*P x 


T-t -1 
+f e^ BB*(e™)* Pas] eat (9-55) 
By equating the corresponding elements of the matrices on both sides of the 
equation, we obtain the expressions for the elements of matrices F,(T — t) 
and H,(T — t) for optimal performance. Using A and B given by Eq. 


(9-52), we obtain 
A. 2:0 
Va " va T 


As Wr 
M= | oo] aeg ats 
OLI L0:A, 


— ——————————— Oa Á patna 


and 
Ass! [f pAsty* 1 d 
e* BB*(e^)*P as: eae [Bz 0] | E 
0 terso 0 :(e^5)*] [Pos Poo 
7 ie B,Bi(e**)* i 4 ES x 
0 ‘0 Paii Poo 
= eee Py, d 
0 i 0 
Define 
MT — 1) =f" eee ELE 
0 0 i 0 
and 
Ni: x 
Se Sees = P [AMI + MT — A. 
li: Nao l ( n 


then Eq. (9-55) becomes as follows: 
[— F(T — Di B«(T — 1)] 


—[B: 0] [oe yr 0 Nui Nie pem 0 
"UL € WeREOSy (Na Na 


— [B*(e^:(7-0)* N e^: -9 J BE (atc -9)* Npe“ =) 


Q jeArt-0 


ll 


Hence we obtain 
F(T t t) = B*(e^-7-0)* N,e* 0-0 


H,(T ges t) — —; Bž(e^T-0)*N e^ (T-t) 


(9-56) 


Equations (9-56) define F,(T — t) and H,(T — t) for optimal performance. 

In order to mechanize the optimal control system, it is necessary to pro- 
gram the feedback time-varying matrix F,(T — t) and the input time-varying 
matrix H,(T — t) as functions of time remaining in the process, T — t. In 
other words, the time remaining, T — /, is a quantity that must be available 
throughout the control process in order to realize the optimal control law 
specified by the theory. 

In presenting the method, we intentionally used a simple quadratic or 
Hermitian performance index. As a matter of course the method applies to 
more complicated quadratic or Hermitian performance indexes as well. 

Note that the system is not designed for any particular input, but is 
designed for a polynomial in the general form. The elements of F,(T — t) 
and H,(T — t) can be adjusted according to the degree of the polynomial 
input. 


As an example of the design of an optimal control system by the present 
method, consider a simple case where the plant is governed by the following 


xi 0 1 x; 0 [ ] 


where X, Xs, and u are real quantities. We assume that the input to the system 
consists of a family of ramp inputs. Then, the input satisfies the following 


d i jJ | A 
: = 


If the performance index is given by 


Ke, T) = 2'(T)Px(T) +r | “(Ou tat 


where 
x«t) 1 0-1 0] 
x(t) 0 1 0 —1 
Ut):= ; P= 
E(t) —] 0 1 0 
E,(t) BU 2d 0 1| 


then the elements of F((T — t) and H4(T — t) of the optimal control system 


are determined as follows (for the derivation of these matrices refer to 
Prob. A-9-7): i 


= 4 i 
F(T — 1) = A: [XT — 0) + KT — 0D ix -X(T — t iT try] 
= [A(T — t)i fT — 1) 
d | 
H,(T — t) = à: [(T—0-c-XYT—0ixc-XxXT—rD- i(T — t)’] 
= [A(T — t)i AT — $) 
where 
A, SNENMT- 9T- + dry 
In this simple case, F,(T — 1) and H,(T — t) become the same. Figure 9-13 
(A) shows a block diagram of the optimal control system. Figure 9-13(B)isa 
simplified form of the block diagram of Fig. 9-13 (A). If the input to the 
system consists of a family of ramp inputs and if the time derivative of the 


input is constant for each time interval 0 « 1! < T, then the system designed 
here is optimal for the given performance index. 
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Figure 9-13 (A) Block diagram for optimal control of plant defined by x, = u; (B) sim- 
plified block diagram. 


9-6 CONCLUDING COMMENTS 


The use of a quadratic or Hermitian performance index in conjunction with 
a linear plant will result in a linear time-varying control system. The feedback 
and feedforward gains are not functions of the state of the plant but 
merely functions of the remaining time in the process. Note that functional 
forms of weighting factors appearing in performance indexes should be 
chosen from the consideration of the time domain performance and cost 
requirements. 

As shown in this chapter, quadratic or Hermitian performance indexes 
are mathematically tractable. Synthesis procedures are straightforward. The 
resulting optimal controllers, however, are not simple to build in the sense 
that they are time-varying. 

It is important to remember that before we decide whether to build the 
optimal control system or something inferior but simpler, we should careful- 
ly evaluate a measure of the degree to which the performance of the complex 
but optimal control system exceeds that of a simpler but nonoptimal control 
system. Unless the complex but optimal control system can be justified, we 


MB mat Leld ave€eA-—-l namnlinntad antimal eantral cucteme — Suctemc 
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synthesized by optimal control theory should then be interpreted as a basis 
for defining the ultimate performance limitation under the given performance 
index. Once we know such ultimate performance limitation we can make 
efforts to design a system that is simpler yet close to optimal. 


EXERCISE PROBLEMS 
AND SOLUTIONS 


Problem A-9-1 (9-8) Suppose that a linear system is described by 
X = A(f)x + B(t)u 
where X — n vector (state vector) 
u — r vector (control vector) 

A(t) =n x n matrix 

B(t) =n x r matrix 
The elements of A(t) and B(t) are assumed to be continuous functions of time. 
Assume that the system is completely state controllable. It is desired to bring the 
state of the system from c at / = f, to xy in time T. Obtain the optimal control 
vector which brings the state from c to x, in time T with the minimum of the 
quantity 


To 
l , o w*()u(o) dr 


Solution Any integrable control vector u(t) transfers the initial state 
c to the final state x; if the following equation is satisfied: 


X(T + to) =x, = DT + to, to) € + ‘eng OT + to, 7)B)u(r)dr (9-57) 
where dt, fo) is the solution of the following matrix differential equation: 
Blt, to) = ADU, to), — (t, to) = 1 


There is one control vector u(t) such that it also yields the minimum value for 
the quantity 
Tilo 
f , o u*(Du()dt 
Denote this particular u(t) by uo(¢). We can obtain u,(t) by applying calculus of 


variations using Eq. (9-57) as the equality constraint. Namely, first determine u(r) 
such that 


| (u*(t)u(t) + A*®(T + to, t)B(t)u(t)] dt 


where A is a Lagrange multiplier (an n vector), is minimum and then determine A 
by use of Eq. (9-57). The result is 
u(t) = B*()B*(T + to, AVT + to, to) [X(T + to) — B(T + to, to)c] 
where 
VCT + tos to) = (I BT + to, DBOBN OO T + to,t)dt (9-58) 


Problem A-9-2 (9-8) Consider the system defined in Prob. A-9-1. 
It is desired to bring the state of the system from c to x; in time T. As time T ap- 
proaches zero the quantity 


T+to " 
[leer ae 
approaches infinity. Therefore, there is a least time 7; such that 
f a luc)? dt = K (9-59) 


where K is a given positive real number. 
Show that 7 is the smallest value of T for which the following matrix is posi- 
tive semidefinite: 


KV(T + to, to) 
— [x(T + to) — B(T + to, to)c] [X(T + t) — B(T + to, to)c]* 
where V(T + to, fp) and @(f, to) are defined in Prob. A-9-1. 
Solution The solution to the system differential-equation is given by 


x(t) = Blt, tre + dt, to) ja D-(r, tc.) Br) u(r) dr 


= ex, ne + | d dr, 7) B(z)u(7)dr 
From this equation we obtain 
x(T + to) — OT + to, toe = | vi OT + to, 7)B(r)u(7)dr 
Hence, if A is an arbitrary n vector (A z& 0) 
[AFET + to) — XT + to, tell = | f NE MOT + to, 7)B(r)u(r)dz|| (9-60) 
Using the following Schwarz inequality, 
[irena] «(tree [nner] 


where u(7) and v(7) are r vectors, Eq. (9-60) becomes 
|| AEXCT. + to.) — B(T + to, to)e]ll 
T 
t 


< UL |A*@(T + to, T)B(T)| dr f us ||u(7)]P e] 


Squaring both sides of this last inequality and making use of Eq. (9-58) and Eq. 
(9-59) we obtain 
A* ([x(T d to) = DT + fo, to)c] 
[X(T + to) — B(T + to, to)c]*} À < KA*V(T + to, 1)À 


If the specific state x(T + 7j) = x; is controllable relative to Eq. (9-59), this last 
inequality should be satisfied for any A where A = 0. Therefore, 


C(T + to, to) = KV(T + tos to) — [x(T + to) — O(T + to, toe] 
- [X(T + to) — B(T + to, to)c]* 


| 
| 


Should be positive semidefinite. The minimum time which the state x, can be 
reached is then To, the smallest T for which C(T + fo, to) is positive semidefinite. 

Problem A-9-3 (9-7,9-10) Consider a tumbling satellite in orbit. 
The problem is to reduce the angular velocity of the rigid body rotating about its 
center of mass. The control process consists of applying a time-varying torque 
vector to the rotating body in such a manner that the following performance index 
is minimized: 


T 
J(e, T) = x'(T)x(T) + X [ , w(Dw(at 
where x(t) is the angular momentum vector of the rigid body and u(r) is the applied 
torque vector. Find the optimal control vector u(r). 


Solution Euler’s equations for a rigid body rotating about its center 
of gravity are obtained from the following differential equation: 


H+@xH=T, H(0)=c (9-61) 


where H = angular momentum vector 
@ = angular velocity vector 
T = external torque vector 


Denoting the angular momentum vector by x and the applied torque vector by u, 
Eq. (9-61) can be put into the standard-form 


x-Ü(x)x-u x(0)=c 


Denoting the moments of inertia about the principal axes of inertia as 4, B, and C 
respectively, this last equation becomes 


E (B e C)o0:05 ui 
x= Bor = (C — A): + | us 


Có; i (4 — B)o10» us 
0 O3; — a: | [401] ui 
—-|—o; 0 @1 | | Bo» | + | us 
€» — (9 0 Co; | us 
or 
Xi 0 oe me I [ x, uy 
A 1 
Xx.) =| 7 Th 0 a Xa |+| uo 
1 1 
Xs B® - aq 0 ls Us 


Here xı, x», and x; are state variables, and 1, wu», and u, are components of the 
applied torque about the principal axes of inertia. Note that Q(x) is a skew sym- 
metric matrix. The minimized performance index is 


fie T) = min] x/(7)x(T) x, i d w'(w(ar | (9-62) 
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The partial differential equation which should be satisfied by /(c, T) is given by 


i£ = min| Aw'(0)u(0) + (V 4 (9-63) 
where vn =E % aa 
and 
de = Oeje + u 
The optimal torque vector u;(0) is obtained as 
w(0) = — VS (9-64) 
Substituting Eq. (9-64) into Eq. (9-63), 
2f = (vy (Qc 3, V/) (9-65) 


From Eq. (9-62) we know that f(c, 0) = c/c. Therefore, we assume a solution of 
the form 


fle, T) = e'S(T)c (9-66) 


where S(T) is symmetric and S(0) — I. Substituting Eq. (9-66) into Eq. (9-65) 
and rearranging, 


[E sa- gs + iss] = 
e| 8 — sQ- O's + SS= o 
The solution to the following matrix Riccati equation 


dS — Lame 
& - SQ - O'S + +ss=0 


can be found by the standard technique. In this problem, however, because of the 
skew symmetric nature of the matrix Q we may assume S(T) in the following from: 


100 
S(T)-2s(T)|0 1 0j, 
00 1 


(9-67) 


5(0) = 1 


Then, 
sQ + Q's-0 
and Eq. (9-67) is simplified to give 


HRS Sae 
+ 388-0 


s(T) is then determined from 


ds(T) 
aT 


doar = 
+ xs) 0 


cer care ed cs oo a a Sn nic einen Mc À — | 
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which can be solved easily to give 


d ae 
(D= RT 
Hence, 
E A zu A / 
fe T) = e( tale = mee 


The optimal control vector is 


1 


w(0) = ->F T 


c 


Letting T — t be the duration of time remaining in the process and x(/) be the 
current state, the optimal control vector uj(t) becomes as follows: 


bo 1 
u(t) = FTD x(t) 


Problem A-9-4 Consider the control system shown in Fig. 9-14. 
The performance index is given by 


J(e, T) = x (T)X(T) + f u (t)u(t) dt 


Xi X 
x= |l aeo 
X2 s4 


where 


Optimal 
Controller 


Figure 9-14 Control system. 


and X is a positive constant. The optimal control function uo which minimizes this 
performance index can be obtained in the following form (See Example 9-1.): 


{MT — 1) + XT — 0)x(0 + Qv XT — t + UT — 09) xlt) 


ult) = — 
V+ MT — t) + AA(T — £y + 4 (T — 4) 


Assuming X = 1, T = 1 and the initial condition x,(0) = 2, x.(0) = 2, draw a 
state plane trajectory for the time interval 0 < 1 < T. 


Solution Divide T into N equal time increments At, or N At = T, 
and then compute uo(n At), where, for A = 1, 


E [(T — nAr) + (T — n At] x(n At) 
1+ (T — nAt) + (T — nAd + 4T — nAr)' 
HI (T - nAPD + KT — nA] xi(nAr) 
1+ (T — nAt) + HT — nât)’ + 44(T — nny 
If we divide, for example, T into five equal time increments, then Ar = 0.2, and 
uc(0) = —0.62x,(0) — 0.96x2(0) 
uo(0.2) = —0.56x,(0.2) — 0.90.x2(0.2) 
uo(0.4) = —0.46x,(0.4) — 0.85.x2(0.4) 
uo(0.6) = —0.34x,(0.6) — 0.83x«(0.6) 
uo(0.8) = —0.18x,(0.8) — 0.87x2(0.8) 
uo(1.0) = —x2(1.0) 
The state plane trajectory can be plotted by the use of the equation 


dx» u . 

—— = — = slope of trajector 

dx; Xs p J y 

The value of u for each time interval, 0.2 sec, is assumed to be constant. (The 
values of u are taken at the midpoint of each interval.) Figure 9-15 shows the state 


plane trajectory. 


uo(n At) = 


f(2,2;1)=12.1 
2 7-0 


f(x, (1), x50); 7-7) 
| decreases along 


the trajectory 


£(2.87, 0.26 ;0) 
Pei -8.3 


Figure 9-15 State plane trajectory. 


Referring to Example 9-1, f(c, T) for this problem can be given by 


MO + T)ei + 2DXT. + (7?/2)]eies + [X + AT? + (72/3)]62) 
M+ XAT + A(T?/3) + (T4/12) 


Substitution of X = 1 and T = 1 into f (c, T) yields 
f(e;, C23 1) = 0.83c} + 1.24cıc2 + 0.96c3 


f(c, T) = 


At the starting point, c; = 2 and c; = 2. Hence 
$(2,2:1) — 121 
At the end of the process, 
f G1), x«(1); 0) = xi(1) + x3(1), 


x,(1) = 2.87, X(1) = 0.26 


Hence 
J (2.87, 0.26; 0) = 8.3 
By a graphical computation 
1 
fiato at 
becomes equal to 3.8. Hence, we can verify that 


/(2,2;1) = f (2.87, 0.26; 0) + Í à ub(t)uo(t) dt 


Problem A-9-5 Given the system configuration shown in Fig. 9-14 
and the performance index 


J(e, T) = x"(T)x(T) + I à x'(Dx(f) dt + fà a u'(t)u(t) dt 


; x x1 
I ere 
X2 x, 
obtain the optimal control function u(t) and the minimized performance index 
f(c, T) = min J(c, T). Plot a few curves of constant f(c, T) for T = 1. 


where 


Solution The state space equation of the system is given by 
X = Ax + Bu 


- à ort 


From Eq. (9-38) and Eq. (9-39) the solutions for the optimal control function and 
the minimized performance index for this problem are given (with X = 1)respectively 


ult) = —B'[D(T — t) + «(T — NDT — t) + (T — 0]7x(0 
Slc, T) = e'[b( 7D) + AT) [,(7) + DT) e 
where the ®;; = ®:,(T) (i, j = 1, 2) are defined by the equations 


where 
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eT = aaa M= Ba 
Pa! Dz 
For this problem 


0 —1 10 0 
0.010 1 
M =| ------ oe 
1 010 0 
I 
0 3d!1 3 
For the following 4 x 4 matrix G: 
oO ONT 9 
! 
1 ‘OO 1 
G =| -----7 EL. 
Q0! 1070 —1 
0 1'0 0 
eST was given in Prob. A-6-12 as follows: 
pb — qa -—pa qb 2qb —2qa 
- pa- qb pb+qa  2ga  .. 24b 
ecT = 
— pa + qb —2qa pb —qa  —pa - qb 
2ga pa-- qb pa — qb pb 4 qa 


where p= + (e(V3/2T + e- (3/27) 


1 
q= (ev 3/2 — e- (3/2) 
24/3 
a — sin T 
- T 
b — cos 3 


Notice that since 


0 ; I 0I 
- pieh 
1; 0 NU 


where I is the 2 x 2 identity matrix, we have 


"m B n ES E +l 2 Er 
I:0 1:0 Dy Di; 


From the definitions of ®,, and ®,. and from Eq. (9-68) 
b — qa + 29b — pa + qb — 24a 
m + Boo = l^ * 
pa + qb + 24a pb + qa + 2qb 
Similarly 
—pa + qb + pb— qa  —2qa — pa — qb | 


+ r = 
gu " | 2qa + pa — qb pa + qb + pb + qa 


(9-68) 
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Inverting 
1 [pa + gb + pb + qa 


2qa + pa + qb 
[®,, + D] = T 


—2qa — pa + qb —pa + qb + pb — qa 
where A = 3q'a? + 2pq + p*b* 
The optimal control function is obtained from 
u(t) = —[O 1] [PCT — t) + (T — NiD (T —1) + (T — 0]? x() 
By defining 
p= i (e(V3/2 (1-0 + e- (372) (T-1)) 


ENS. (3/2) (T-t) — e- (3/2) (7-1) 
li =S (e e ) 
a, = sin Le f 
b, = cos re f 


we obtain 


pibi — qiai  2qi1bi 


wi = qo i] | | —— ie 


pibi + gia: + 2qibi 
2qıaı + pia, + gibi | e 
x(t) 


Pia + qıbı  2qiai 
piat qibi + prbi + gia 
. (9-69) 
—2qiai — pian + qiby —piqi + qibi + pibi— qiai 


The minimized performance index is given by 
f(c, T) = e'[D4 (7) + Bao (T)Bi(T) + (T) c 


= te, iG (aie 2qb — —pa + qb — 2qa 
A pa + qb + 24a pb + qa + 24b 
pa 4- qb + pb + qa 2qa + pa + qb cı 
. (9-70) 
—2qa — pa-- qb — pa + qb + pb — qajl c: 
For T — 1 we have 
p i (ev3/? + e-V3/2) = 1.40 
g= z e — e-Vii?) = 0.567 
(9-71) 
a = sin} = 0.480 


b — cos i — 0.878 


Then Eq. (9-69) becomes 


iilii « zi um rs | 2.07 1.71 de | 
s 1.71 2.50 j| —0.716 0.786} | x«(0) 


= — 0.835 x,(0) — 1.47 x2(0) 


This equation gives the initial value of the optimal control function. The optimal 
control function u,(t) for 0 < t < T is obtained from Eq. (9-69) which is a func- 
tion of T — r. Substituting Eq. (9-71) into Eq. (9-70), we obtain 

1.985 —0.716 2:07 — 171 €1 
1.71 2.50 —0.716 0.786 


" = 1.72ci + 1.67c1c» + 1.47¢3 (9-72) 


f= sigla eal] 
| 


For plotting the constant f (c, T) ellipses, the major and minor axes of the ellipses 
are found. The major and minor axes (é; and ĉ axes) of the constant f (c, T) ellipses 
are related to the c, and c» axes by the relationship 


Fee bel M 


Inverting 
€i cosé —sin£][ à 
NEW cos É a 
Substituting 
cı = ĉ cos É — ĉ sin E 
€» = ĉ sin E + éscos£ 


into Eq. (9-72), 
f(c, 1) = 1.72(ĉ, cos E — ĉ sin £} 

+ 1.67(6i cos £ — ĉ sin É)(éisin E + ĉ cos £) 

+ 1.47(4 sin E + ĉ cos £y? 
Making the coefficient of &,£; equal to zero 

—0.50 cos £ sin É + 1.67 (cos? £ — sin? £) = 0 
Thus 
E = 40.8° or —49,2° 

A few constant f(c, 1) ellipses are plotted in Fig. 9-16. 


Problem A-9-6 (9-6, 9-9) In general, it is not possible to design a con- 
trol system for optimal performance in response to all kinds of system input 
signal r(t). In order to avoid this difficulty, it is common to optimize the control 
system only with respect to a restricted class of input signals. 

Let {r(t)} be a class of system inputs. An element r(t) of this class is defined as 


r()—y() t>0 
=0 t<0 


where y(t) is the first element of the input state vector y(t), where y(t) is the solu- 
tion of 
y = Ay 
Namely, 
y() = ens) 


f (oN) =1.72c? +1.67c C, + 1.4765 


f(c,I)*9 


Figure 9-16 Constant f(c, 1) ellipses. 


€b(r) is the transition matrix of the input state space equation and y(0) is an arbi- 

trary constant vector. Namely, @(t) determines the class of input signals for which 

the system is to be optimized and y(0) determines a particular member of that class. 
Obtain @(t) for the following classes of inputs: 


(1) The class of all step functions. 
(2) The class of all step and ramp functions. 
(3) The class of all sinusoidal functions having the frequency o. 


Solution 


(1) The class of all step functions {r(t)} can be described by 
At)=0, 130 
or putting r = y we have 
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Hence, A = 0 and we obtain 
P(t) —e4 = 1 
(2) The class of all step and ramp functions {r(s)} can be described by 
F(t) = 90, t0 


[3-7 


then the state space equation describing the input can be written as 
b sid 
ye 0 Oly: 


an = di I. | 


Select the input state vector as 


The transition matrix is 


Thus, the input is described by : 
KO} pn npo 

ba 7 i | pn 

(3) The class of all sinusoidal functions having the frequency œ must satisfy 


the differential equation 


F+o°r=0 
Select the input state vector as 


J- 


then the state space equation describing the input can be written as 


eaa 


The transition matrix is 


o 1 cos wt i sin or 
eo = d» o 
—@sinwt cost 
Hence 
r(t) cos œt 4 sin ot | | r(0) 
F(t) —@ sin of cos @t 7(0) 


Problem A-9-7 Consider the control system shown in Fig. 9-17. 


The equation of the plant is given by 
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Figure 9-17 Control system. 


A] 0 dix 0 
H " b "UM +| jo 
The input &,(t) is assumed to be a ramp input. Thus it may be written as 
| & : [9 E & & 
a g l lle} ue Hn 
The performance index is given by 


J(e, T) = z (T)PzT) 4- É u (ur) dt 


where 
xi(t) 1-9 —1 D 
x(t) 0 1 0 —i1 
c = x(0), z(t) — , = 
) éA) - © £ O 
£x) č- 00 1 


Determine the elements of matrices F((T — t) and H«(T — t) so that the system 
behavior minimizes the performance index. 


Solution From the analysis made in Section 9-5, F;,(T — t) and 
Ho(T — t) can be given by the expression 


[-F«(T —£) Ho(T — t] = —B(e^C-tyP 
Dac f° e^ BB (eMYP ds]-te^c-o 


where - 


and Az, A;, and B, are given by 
X = A,X + Blu 
E = Af 


Since for this problem 


we find 


Hence 
2 


» Ss S 
e^5B.B; (easy =e | | 
$ d 


ni 
| M&iT-0- a e^*BB'(e^*P ds 


$ s —s —g 
r-ti| $ 1 —s —1 
=Í ds 
0 0 0 0 
0 0 0 0 
(T-® (T-¢® _(T-¢® _(T-t 
3 2 3 2 
[^E ws 4 LAE gra 
= E Jh 7 (T — 1) 
0 
0 


and 
[AI + M;(T — 1)] 


"oum AE OE S AN ER. 
3 2 


2 3 
Lg T — 1) 
-| ZE xeg-o -G5® -r40 (9-73) 
0 0 X 0 
0 0 0 A 


In Prob. A-2-7 the matrix given by Eq. (9-73) was inverted. Its inverted form is 
[AI + M-(T — HJ 

2 LM pe (Tap T0 

E atr-) -Xg-o s(a) 


hie. as aati.» a S m X mere 
—X(r-9 v C) Mr-D Eunc-o«ca-on 


1 — a 
x á A 132-2 -1) " 


TAA(T — 0 - (T— t] 


iue 12A(T— f) 


A? 2 
(7-1) | 


TÁAAC(T — 0 - (T—20*] : 


where 
A =N ATOH MT - pag 


Define 
Nui Ni 
PIAL + M - D)H = Rd 
Na I hm 


Then we obtain 


MA+T-) -LX(r- 


-— 
n= N i T " 
—— S os 2 2 = 
ZAT- x(x Fo) 
and 
=~VO+ Fas SUT — 1) 
Nie = x i ( y 
` A2 -w-— NC T—t 
3T - 120 x(a T) 
Hence 


Fo(T — t) = BA(e^-9yN, eå:(T-0) 
«E ID E TD -LAT = e} i Fed 


EN | 
-4D 1) 
T-t i| -ixm-m nàr T) e i 
-A[Xr-04lq- [N+ MT — D +r- i] 
where Amg MEME jeg iT- +AT- 


X: 
and 
HT —fD)- —Bi(e^-(T-DyYN;5 e^t (7-0 


-A[xr- l(r-.p: Lp eter —p 
x MT =t) + zT t) IA +AT Dy + 3 Cr 2 
In this problem F,(T — t) and H,(T — t) are identical. The use of the time-varying 
matrices F(T — t)and H«(T — t) just obtained will result in the optimal perform- 
ance under the given performance index. 


Problem A-9-8 Consider the system given by the following state 
space equation: 
X = Ax + Bu, x(0)—c 
where X — n vector (state vector, actual state of the system) 
u — r vector (control vector) 
A =n X n constant matrix 
B =n X r constant matrix 
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The desired state of the system is a function of time and is given by xa(t) where 
xia (t) 


Xa (t) 


x(0-| . |, xe0) =e 


Xnd (£) 
We assume that the desired state xa(t) of the system is known. We further assume 
that x, Xa, u, A, and B are real. The system error is defined by the difference be- 
tween the desired state and the instantaneous actual state during the control 
process. Thus, the system error vector is given by x(t) — x(t). The performance 
index is given by 


He, T) = f tx«(0 — xOY QUO [xt — xde + [^ wtORGw(Oa: 


where Q(t) = positive definite (or positive semidefinite) 
real symmetric matrix 
R(t) = positive definite real symmetric matrix 


The matrix Q(t) determines how the instantaneous errors are to be weighted during 
the control process. The matrix R(t) represents the weight of control cost with 
respect to the minimizing errors. 

Define f (c, T) as the minimized performance index, or 


fle T) = min J(c, T) 
5 T ? 
= min{ f [xe(t) — x(OY QO [xe() — x(0) dt + f à v (RGw(0)dt) 
Show that f(c, T) can be determined by expanding f(c, T) in the following form: 
fle, T) = KT) - 2 È (Tei + X X kes(Tees 


where ku (T) = ka(T) .. 


and determining coefficients k(T), ki(T), and kij(T). Discuss methods of deter- 
mining these coefficients. 


‘Solution We shall first obtain the partial differential equation which 
should be satisfied by f(c, T). It is of the same form as Eq. (9-29) and can be obtained 
as follows: 


Of = min (Ix«(0) — xO’ Q(O)Ix« (0) — x(0)] 


+ w’(0)R(0)u(0) + (VS) LAx(0) + Bu(0)]] (9-74) 
The value of u(0) that minimizes the right-hand side of Eq. (9-74) can be obtained 
by solving 
0 = 2R(0)u(0) + BY VS 


Hence, the optimal control vector is obtained as 


dis — ro B'Vf (9-75) 
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where 


E] Ci 
Vf =gradf=| : 
of 
Cn 
Substituting Eq. (9-75) into Eq. (9-74), 


Of = [ea — c Q()[es — c] + (VAY Ac - L(V/)BR-()B'V/ (9-76) 


This is the partial differential equation which f(c, T) must satisfy. Let us assume 
the following quadratic form of the solution: 


f(x), T — 1) = [x(?) — ke(T — NY S(T — O[x() — ke(T — À) 
or 
f(e, T) = [c — k-(T)VS(T)le — k-(T)] 
By expanding f (c T), we obtain 


fle, T) = KT) - 2X k(T)ai + EY ku(Tyee; (9-77) 
where Ki(T) = ky (T) 


This expression is called a parametric expansion (9-11) because the coefficients k,(T) 
and k;,;(7) turn out to be parameters of the optimal control system. (Refer to 
Prob. A-9-9.) By differentiating Eq. (9-77) to obtain the derivatives of f(c. T) ap- 
pearing in Eq. (9-76) and then by substituting these derivatives into Eq. (9-76), 
the terms are collected such that the resulting expression is of the following form: 


Po + Pici + Paca +++- Pas + Puci + Piscico +++ PinCicn 
+ Pach + Poscacs +++ + PonCoCn e Panch = 0 (9-78) 


The P coefficients consist of terms containing the k parameters and their first deriv- 
atives, the desired state variables, the elements of A and B which describe the 
plant, and the elements of Q and R, which are weighting factors. If Eq. (9-78) is 
to be valid for all possible values of the initial state variables, each P coefficient 
independently must be equal to zero. Hence, 


Po = P = P;=0 J = 1, Zra (9-79) 
Equation (9-79) yields a set of 1 + n + n(n + 1)/2 independent ordinary fitst- 
order differential equations which define the k parameters. These may be called the 
k equations. As a matter of course, quantities appearing in the k equations, such 
as the desired state variables, matrices A, B, Q, and R, must be specified before the 
solution can be carried out. The boundary conditions on these k parameters are 


k(0) = k:(0) = k; (0) = 0 (iJ = L2- n) 


Except for simple cases, computation of A(T — t), Ki(T — t), kiX(T — t) for 
0<t < T will require a digital computer. Once the k parameters are determined, 
then the optimal control vector given by Eq. (9-75) can be written in terms of the 
actual state vector and the k parameters, using Eq. (9-77). 


Problem A-9-9 (9-11) Consider the landing problem of an aircraft 
system defined by 


X — Ax 4- Bu, x(0) —c 


where 
Xi an di a3 0 bi 
X2 1 0 0 0 0 
x= " A = » B = 
Xs 0 Q32  Q33 0 0 
X4 0 0 1 0 0 
and xı = aircraft pitch rate 


X» = aircraft pitch angle 
xs = time derivative of aircraft altitude 
x, = aircraft altitude 
— aircraft elevator deflection 
and A11, 412, 413, A32, 033, and b, are constants. The performance index is given by 
J(c, T) = G [xa (t) — x()l Q()[xa lt) — x(D]dt + L u'(t)u(t)dt 
where 
xia (t) 
Xa(t) = bass = desired state vector (given for 0 < 1 < T) 
Xsa (t) 
Xaa (t) 
xı (£) 
x(t) = m = actual state vector 
xs (t) 
x4 (1t) 
q(t) 0 0 0 
0 q(t) 0 0 
0 0 qt) 0 
0 0 0 qx(t) 


The elements q;(t), go(t), qs(1), and q4(t) in Q(t) are time-varying weighting factors 
which indicate the relative importance of the various terms in the performance index. 
(In carrying out the design of the optimal control system, these weighting factors 
must be selected such that the performance requirements and constraints are 
satisfied.) 

Show that the optimal control law can be written as follows: 


uo(t) = bila(T — t) = ku(T e t)x (t) dh kis(T = t)x2(t) 
— ky3(T — t)xs(t) — ku(T — toa (t0) 


Qu) = 


Solution Define 
f(e, T) = min J(c, T) 


= min[[^ Ix«() — x) Q(OIx«() — x(0)ldt 
+ L u' (t)u(t) dj) 


The partial differential equation which should be satisfied by f(c, T) can be obtained 
as follows: [The x; appearing in partial derivatives are x;(0).] 


95 = min[ai(Obss(0) — xi (OT + as (sio) — x: (T 
+ qs(0)[xsa (0) — x3(0)] + g«(0) [x44 (0) — x4 (0)? + u(0) 
+ [225 (0) + arexe(0) + ass (0) + bu £ 


ze x2, + [232 x2 (0) + asi (0)] ££ + x;3(0) 2L (9-80) 


The expression for the optimal control signal uo(0) is obtained by taking the partial 
derivative of the quantity within the braces with respect to (0). This expression is 


bi 
uo(0) = — S gf (9-81) 
Substituting Eq. (9-81) into Eq. (9-80) we obtain 


SE = a Oba (0) — xi (OP + ge (O)Lrss (0) — x2(0)F 


2 


+ qs(O)Lxsa (0) — xs (0). + qs (0)[x.« (0) — x4 (0)? + E) 


"s [a2 (0) + @12x2(0) + ai3x3(0) — a sf) af 


4 x0) $f + [asex2(0) + 235x3(0)] ge + x;(0) M (9-82) 


If Eq. (9-82) can be solved for f(c, T), then the optimal control signal 1(0) can be 
found using Eq. (9-81). 
In order to solve this problem let us assume f(c, T) in the following form: 


f, T) = KT) - 2 Y (T) (0) 


+ E, E Ko CT» (0x0) (9-83) 
where kis(T) = ka (T) 


The expression for the optimal control signal given by Eq. (9-81) can now be ex- 
pressed in terms of the state variables and the k parameters using Eq. (9-83). 
Taking the partial derivative of f(c, T) with respect to x, and substituting into 
Eq. (9-81) gives 
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Uo (0) = bili (T) = Ku(T)xi(0) = Ky2(T)x2(0) 
— ki3(T)x3(0) — Ku (T7)x4(0)] 


If we denote ¢ as the present time, T — t as the duration of time remaining and 
x(t) as the current state, then the optimal control signal is given by 


u(t) = bla (T — 2) — ku(T — x(t) — ku(T — t)x(t) 
— kis(T — x(t) — kulT — x (d) (9-84) 


This control law defines the configuration of the optimal control system. In carrying 
out the mechanization of the optimal control system the values of the time-varying 
gains must be obtained. These values are found by differentiating Eq. (9-83) to 
obtain the derivatives of f(c, T) appearing in Eq. (9-82); then, these derivatives 
are substituted into Eq. (9-82) and the terms are collected such that the resulting 
expression is of the following form 


Po + Pix (0) + Paxs(0) + Psx3(0) + Poxi(0) + Puixt(0) 

+ Piaxi(0)xs(0) + Pisx1(O)x3(0) + Pia xı (0)xa (0) + Poe x3 (0) 

+ Posx2(0)x3(0) + Pz: xs(0)x4(0) + Ps3x3(0) + Ps4 x3 (0)x4 (0) 

+ Pixi(0) = 0 PE (9-85) 


The P coefficients consist of terms containing the A parameters, the desired state 
variables, the elements of A and B which describe the aircraft, and the weighting 
factors q,(t). 

If Eq. (9-85) is to be valid for all possible values of the state variables, each 
P coefficient must independently be equal to zero. Hence, 


Pup Pij = 0 (9-86) 
(i, j = 1, 2, 3, 4) 


Equation (9-86) yields the fifteen independent ordinary first-order differential 
equations which define the k parameters. The boundary~conditions on these k 
parameters are 


K(0) = k:(0) = ki;(0) = 0 
(,j = 1, 2, 3, 4) 


Notice that the analytical design problem is thus reduced to the solution of the 
k equations. In general, the solution of the k equations requires the use of an analog 
or digital computer. Notice also that the solution of the k equations cannot be 
carried out until the desired state variables, aircraft parameters, and weighting 
factors are specified. If the desired state variables and the aircraft parameters have 
been specified, then, the design procedure reduces to merely the selection of a set 
of weighting factors g;(1) (i = 1, 2, 3, 4) which will result in satisfactory system 
performance under the given circumstances. 

The analytical design technique discussed in this problem has important 
advantages. In particular, multiple design considerations associated with complex 
dynamic situations can be included in the system design. The functional forms of 
the weighting factors appearing in the performance index can be chosen directly 
from the consideration of the time domain nerformance reauirements. Except for 
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certain mathematical restrictions, the functional forms of these weighting factors 
can be chosen without consideration of the System stability. One disadvantage of 
using the present technique, however, is that, generally speaking, a unique set of 
weighting factors does not exist when a set of design considerations is given. In fact, 


the simpler the design problem from dynamic considerations, the larger the number 
of sets of acceptable weighting factors. 


PROBLEMS 
Problem B-9-1 A control system is defined by 
x = Ax + Bu 
where X = n vector (state vector) 


u — r vector (control vector) 
A =n X n matrix 
B =n x r matrix 


Show that the control vector u will transfer the system state from x(0) at t = 0 to 
x(T) at t = Tif and only if 


[L e-^'Bu(t) dt = e-^7x(T) — x(0) 


Problem B-9-2 Consider the control System shown in Fig. 9-18. 
Define x, — x, x, — x. Suppose that the performance index is given by 


J(e, T) = x'(T)«(T) + { E u (rur) dt 


where A. is a positive constant. Obtain x and f(c, T) = min J(c, T) where c = x(0). 
u 


With A - 1 and T = 1 find the state plane trajectory starting from (2, —2) for 
the time interval 0 < 7 « 1. 


r=0 


for #20 Optimal 
Controller 


Figure 9-18 Control system. 


Problem B-9-3 Obtain the Hamilton-Jacobi equation of classical 
mechanics from the principle of optimality. Hamilton’s principle states that a 
particle moves so as to minimize the Lagrangian 


t 
[Leaner 


where q(t) is chosen so as to transform state Q given at time fp into state q at timer 
while minimizing the Lagrangian. [L is the difference between T and V, or L = 
T — V, where T is the kinetic energy and V is the potential energy of the system.] 


Problem B-9-4 Given the control system defined by 
Ý 0 1l fx 0 
Maie ML. 
Xs 0 -—1 x2 1 
and the performance index 


Xe, T) = ¥(T)x(T) +) f : u (wt) dt 


where c = x(0) and X is a positive constant, obtain the optimal control function 
uy and the minimized performance index, min J(c, zy 
" u 


Problem B-9-5 Consider the plant defined by 
ý 0 1 0 
JE EA 
Xs 0 —-1jix 1 


Assuming the input &,(t) to the control system, which has a similar configuration 
as that of Fig. 9-17, is a class of ramp input, and the performance index is given by 


J(c, T) = z(T)Pz(T) + f á POT OL. 


where 
xı (£) io e 09 
ma ael. me] ^. i 
à & (n) =A 0 1 0 
E, (1) 0.—1..09 1 


determine the elements of matrices F(T — t) and H«(T — 1) so that the performance 
index is minimized. 
Problem B-9-6 ‘Consider the system governed by the following 


xi 0 =A 1 x 0 
= d [u] 
X: 2 0 0 X2 1 
and the performance index 


T T 
He, T) = |? txat) — xOYxe() — x(0) dt + x f, w Oul) dt 


equation: 


where 
x(r) = jen — actual state vector 
x(t) 
| €i | m 
c= = 
C2 x«(0) 
— e-t 
Xa(t) = lee" — desired state vector — i iE 
Xoa(t) et | 


Define 
f(c, T) = min J(c, T) 


By assuming f(c, T) in the following form: 
f(e, T) = k(T) — 2IY(T)ei — 2k(T)es + ku(T)et + 2ki(T)cic2 + ko( T)e$ 


find the optimal control function u, and the minimized performance index f (c, T). 
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